
In today’s rapidly developing AI era, comprehensively and objectively evaluating the performance of Chinese general large models has become a focus for developers and researchers. SuperCLUE emerges as a comprehensive benchmark for Chinese general large models, providing authoritative references for model capability assessment.
Website Introduction
SuperCLUE was launched by the CLUE academic community in May 2023, aiming to comprehensively evaluate the performance of Chinese large models through a multi-dimensional evaluation system, helping developers and researchers understand the strengths and weaknesses of models.
Key Features
- Basic Abilities: Including 10 capabilities such as semantic understanding, dialogue, logical reasoning, role-playing, code generation, and creation.
- Professional Skills: Involving middle school, university, and professional exams, covering more than 50 capabilities in mathematics, physics, geography, social sciences, etc.
- Chinese-Specific Features: Targeting Chinese-specific tasks, including 10 capabilities such as idioms, poetry, literature, and character forms.
Related Projects
- SuperCLUE-OPEN: Multi-turn open-ended question evaluation, assessing model performance in open dialogues.
- SuperCLUE-OPT: Objective question closed-book test, based on over 3,700 multiple-choice questions, evaluating the objective performance of models in basic, Chinese, and professional abilities.
- Langya List Anonymous Battle: Through an anonymous battle mechanism, dynamically tracking the performance differences of mainstream models domestically and internationally.
Advantages
SuperCLUE’s multi-dimensional evaluation system enables developers to comprehensively understand model performance and optimize accordingly. Additionally, the platform updates rankings monthly, keeping up with model development dynamics and providing users with the latest evaluation results.
Pricing
SuperCLUE is currently open for free, and users can access relevant information and resources through the official website or GitHub project address.
Summary
The CLUE academic community was established in 2023, located in China, dedicated to providing a comprehensive evaluation benchmark for Chinese large models. Through SuperCLUE’s innovative features, users can obtain comprehensive and objective model evaluation results, assisting in model development and optimization.
Relevant Navigation


HELM

Chatbot Arena

Stable Chat

Stable Chat

OpenCompass

MMLU
