CogVLM-SFT-311K: Bilingual Visual Instruction Data in CogVLM SFT
CogVLM-SFT-311K is the primary aligned corpus used in the initial training of CogVLM v1.0. The process of constructing this dataset is as follows:
Explore
138,945 skills indexed with the new KISS metadata standard.
CogVLM-SFT-311K is the primary aligned corpus used in the initial training of CogVLM v1.0. The process of constructing this dataset is as follows:
📗 README in English
CogVLM-SFT-311K 是我们在训练 CogVLM v1.0 最初版本时使用的主要对齐语料库。此数据集的构建过程如下:
LOCALWORLDSIZE=8
pycache
build:
* text=auto
pycache
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities.
[*]
.git
We are committed to providing a welcoming and inclusive environment for all people, regardless of age, body size, caste, disability, ethnicity, gender identity and expression, level of experience, fam...
site_description: Evaluation framework for your AI Application
<img style="vertical-align:middle" height="200"
This comprehensive guide covers development workflows for the Ragas monorepo, designed for both human developers and AI agents.
We take the security of RAGAS seriously. If you discover a security vulnerability in this project, please report it to us privately. Do not report security vulnerabilities through public GitHub issues...
INHERIT: ./mkdocs.yml
repos:
.DS_Store
No description available.
mkdocs:
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
test_resources