130 Million - Chinese Test Question Texts from Elementary School to University Parsing And Processing Data
2.4Million Pairs Image Caption Data Of General Scenes
700,000 Sets Image Caption Data Of General Scenes
20,846 Groups Image Caption Data of Cookbook
Japanese OKWAVE Q&A platform Text Parsing and Processing Data
1 million - Chinese Code Questions Text Parsing And Processing Data
25000 People - Multiple Styles Video Data
100,000 Instruction-Following Evaluation SFT for Chinese LLM Text Data
6.03 Million - Majors Questions Text Parsing And Processing Data
2.4 million - Korean Test Questions Structured Analysis Processing Data
32 million - Science Subjects Questions Text Parsing And Processing Data
300 million pairs of high-quality image-caption dataset
7 Million Sets - High-Quality Video Caption Dataset
100,145 Sets of ICONS Image Caption Data
100,000 Fine-Tuning text data set for English LLM General Domain SFT