CMMLU CMMLU is an evaluation benchmark designed for the Chinese context, covering 67 topics to comprehensively test language models' knowledge and reasoning abilities, with a particular emphasis on China-specific knowledge areas. 880,88095.1K Model Evaluation# China-specific knowledge# Chinese evaluation benchmark# CMMLU