Aug 20, 2025 06:42 PM (GMT+8) · EqualOcean
China is the first country to explicitly classify data as a factor of production. As of the end of June this year, China has built over 35,000 high-quality datasets, with a total volume equivalent to approximately 140 times that of the digital resources of the National Library of China, providing a data foundation for AI training. Currently, the proportion of Chinese-language data used in the training of most domestic AI models has exceeded 60%, and some models have reached 80%. The development and supply capacity of high-quality Chinese-language data continues to strengthen, driving significant progress in the performance of domestic AI models.

Source: