Skip to main content
  1. Data Science Blog/

Power of Chinese AI Models

·438 words·3 mins· loading · ·
AI/ML Models Technology Trends & Future Artificial Intelligence AI Models AI Industry Research Methods

Power of Chinese AI Models

Power of Chinese AI Models
#

Introduction
#

After the Deepseek R1 turmoil in the market, there has been a shift in attention towards China. The West is now looking towards the East, and even those in the East are turning their gaze northward.

I was tracking these models for sometime so thought to summarize them at one place for my readers.

Opensource: 🚀

Partially or fully close source: 🔒

List of Chinese Models
#

DeveloperModelSeriesModelsFeatures of this Model
Tsinghua & Fudan UniversityOpenChineseGPTOpenChineseGPT 🚀Dialogue, instruction-following
Tsinghua & Fudan UniversityOpenBuddyOpenBuddy 🚀Dialogue, instruction-following
Tsinghua & Fudan UniversityOpenChineseLLaMAOpenChineseLLaMA 🚀Dialogue, instruction-following
Shanghai AI LabFengshenbang SeriesFengshenbang-13B 🚀, Fengshenbang-7B 🚀General-purpose, multilingual
IDEA ResearchZiya SeriesZiya-LLaMA 🚀, Ziya-13B 🚀Dialogue, instruction-following
Tsinghua UniversityCPM SeriesCPM-1 🚀, CPM-2 🚀, CPM-3 🚀Early Chinese LLMs
HuaweiPanGuPanGu 🔒Large-scale, multilingual
Tsinghua & Fudan UniversityChinese LLaMA & AlpacaChinese LLaMA 🚀, Chinese Alpaca 🚀Dialogue, instruction-following
Fudan UniversityMOSSMOSS 🚀Dialogue, general-purpose
Zhipu AIChatGLM SeriesChatGLM3 🚀, ChatGLM2 🚀, ChatGLM 🚀, GLM-4 🚀Chinese dialogue, multi-turn, long-context
Alibaba CloudQwen SeriesQwen-1.8B 🚀, Qwen-7B 🚀, Qwen-14B 🚀, Qwen-72B 🚀, Qwen-2.5-1M 🚀Multimodal, multilingual, 32K tokens, strong performance on benchmarks
Baichuan Intelligent TechBaichuan SeriesBaichuan-7B 🚀, Baichuan-13B 🚀, Baichuan2 🚀High performance, quantized versions
Shanghai AI LabInternLM SeriesInternLM 🚀, InternLM-Chat 🚀General-purpose, long-context
01.AIYi SeriesYi-1.0 🚀, Yi-6B 🚀, Yi-34B 🚀Multilingual, long-context
DeepSeek AIDeepSeek SeriesDeepSeek-V2 🚀, DeepSeek-LLM-67B 🚀, DeepSeek-R1 🚀High performance, Chinese & English, advanced reasoning for math and coding
Shenzhen Yuanxiang AIXVERTE SeriesXVERTE-7B 🚀, XVERTE-13B 🚀, XVERTE-65B 🚀Multilingual, 256K tokens
Peking UniversityYuLan SeriesYuLan-Base-126B 🚀, YuLan-Chat-3-126B 🚀Multilingual, large-pretraining
Sichuan AI UniversitygLAWLAW 🚀, LAWMiner 🚀, LLAMA 🚀, Fuzz 🚀, Mingcha 🚀Specialized for legal tasks
BaiduERNIEERNIE 3.0 Titan 🔒Knowledge enhanced with 260 billion parameters, supports multiple industries
ByteDanceDoubaoDoubao 1.5 Pro 🔒Better than ChatGPT-4o in knowledge retention, coding, reasoning, optimized for lower hardware costs
TencentHunyuanHunyuan 🔒Supports image and text generation, logical reasoning, aimed at enterprise use
Moonshot AIKimiKimi k1.5 🔒Matches or outperforms OpenAI o1, focused on solving complex problems
SenseTimeSenseNovaSenseNova 🔒Includes models for natural language processing, content generation, data annotation
MiniMaxMiniMax-TextMiniMax-Text-01 🔒Large parameter size (456 billion), outperforms on some benchmarks, large context window
KuaishouKlingKling 🔒Text-to-video model, free to public, simulates real-world motion and physics
iFlytekiFlytek SparkiFlytek Spark V4.0 🔒Improved core capabilities, ranks high in international tests compared to GPT-4 Turbo

Related

Quantum Measurement, Randomness, and Everyday Technology
·778 words·4 mins· loading
Interdisciplinary Topics Research & Academia Quantum Physics Quantum Mechanics Quantum Computing Interdisciplinary Topics
Quantum Measurement, Randomness, and Everyday Technology # This is Part 2 of Learning Quantum …
AI Agents as First-Class Citizens: Why Managing the Digital Workforce Is the Next HR Challenge
·2607 words·13 mins· loading
Artificial Intelligence Business & Career Technology Trends & Future AI Integration Future of Work AI Governance Organizational Design Generative AI
AI Agents as First-Class Citizens # Why Managing the Digital Workforce Is the Next HR Challenge …
When Consciousness Becomes Cosmos: Fields, Particles, Matter, and the Emergence of Size
·5741 words·27 mins· loading
Philosophy & Cognitive Science Interdisciplinary Topics Quantum Field Theory Consciousness Physics Advaita Vedanta Philosophy of Mind Emergence Metaphysics
When Consciousness Becomes Cosmos # From Consciousness to Cosmos: Fields, Particles, Matter, and …
Occam's Razor: Why the Simplest Explanation Often Wins
·994 words·5 mins· loading
Philosophy & Cognitive Science Interdisciplinary Topics Data Science Occam's Razor Critical Thinking Scientific Method Simplicity Decision Making Machine Learning Software Development
Occam’s Razor: Why the Simplest Explanation Often Wins # Prefer fewer assumptions until the …
From Claw Code to Clean Room: A Developer's Guide to Re-implementing Software Without Getting Sued
·2854 words·14 mins· loading
AI Ethics & Governance Software Development Technology Trends & Future Clean Room Design Intellectual Property AI Code Generation Software Copyright Trade Secrets Software Development
From Claw Code to Clean Room: A Developer’s Guide to Re-implementing Software Without Getting …