CMMLU

Total 2 articles 网址

Sorting

CMMLU

CMMLU is an evaluation benchmark designed for the Chinese context, covering 67 topics to comprehensively test language models' knowledge and reasoning abilities, with a particular emphasis on China-specific knowledge areas.

903,22095.1K

Model Evaluation # China-specific knowledge # Chinese evaluation benchmark # CMMLU

CMMLU

CMMLU是一个专为中文语境设计的综合性评估基准，涵盖67个主题，旨在全面测试语言模型的知识储备和推理能力。

894,58530.8K

Model Evaluation # AI模型评测 # CMMLU # 中文评估基准