CMMLU CMMLU是一个专为中文语境设计的综合性评估基准,涵盖67个主题,旨在全面测试语言模型的知识储备和推理能力。 881,79530.8K Model Evaluation# AI模型评测# CMMLU# 中文评估基准
CMMLU CMMLU is an evaluation benchmark designed for the Chinese context, covering 67 topics to comprehensively test language models' knowledge and reasoning abilities, with a particular emphasis on China-specific knowledge areas. 881,12095.1K Model Evaluation# China-specific knowledge# Chinese evaluation benchmark# CMMLU