
10,000 Sets-Digital Chart Q&A Data

30 Million High-quality Video Data

80 Million Vector Image Data

200 Million High-quality Image Data

500,000 Images - Natural Scenes and Documents OCR Data

30,000 Images - Natural Scenes OCR Data in Southeast Asian Languages

100,000 Sets of ICONS Image Caption Data

6.9 million - Chinese Multi-disciplinary Questions Text Parsing And Processing Data

1 million - Chinese Code Questions Text Parsing And Processing Data

161 Hours - Gujarati(India) Scripted Monologue speech dataset

32 million - Science Subjects Questions Text Parsing And Processing Data

114,000 - Chinese Contest Questions Text Parsing And Processing Data

1500 Hours - French(Canada) Real-world Casual Conversation and Monologue speech dataset

5,000 Images of Turkish Natural Scene OCR Data

155 Hours - French(Canada) Spontaneous Dialogue Smartphone speech dataset

900 Hours - Thai(Thailand) Real-world Casual Conversation and Monologue speech dataset

2000 Hours - English(Australia) Real-world Casual Conversation and Monologue speech dataset

20,846 Groups Image Caption Data of Cookbook

1.5 million - Korean Test Questions Structured Analysis Processing Data

1528 Hours - Gujatati(India) Real-world Casual Conversation and Monologue speech dataset
. . .