LBB.AI Toolbox (Link, Build, Beyond)
  • Home
  • Submission
  • Ranking
  • About
    • Home
    • Submission
    • Ranking
    • About

    Language Model Evaluation

    Total 3 articles 网址
    Sorting
    releaseViews
    C-Eval

    C-Eval

    C-Eval is a Chinese foundational model evaluation suite jointly developed by Shanghai Jiao Tong University, Tsinghua University, and the University of Edinburgh. It comprises 13,948 multiple-choice questions across 52 disciplines and four difficulty levels, aiming to comprehensively assess large language models' Chinese comprehension and reasoning abilities.
    882,22595.1K
    Model Evaluation# AI Assessment# AI evaluation tool# C-Eval
    MMLU

    MMLU

    MMLU (Massive Multitask Language Understanding) is a benchmark test launched by the University of California, Berkeley in September 2020, aiming to comprehensively evaluate large language models' multitask understanding across 57 different domains.
    882,28595.1K
    Model Evaluation# AI Benchmark# AI Model Assessment# Language Model Evaluation
    HELM

    HELM

    HELM (Holistic Evaluation of Language Models) is a comprehensive evaluation system for language models introduced by Stanford University, aiming to assess the performance and characteristics of language models through standardized datasets, unified model interfaces, and multidimensional evaluation metrics.
    882,25095.1K
    Model Evaluation# AI Assessment# HELM# Language Model Evaluation
    No more

    标签云

    LBB.AI Toolbox (Link, Build, Beyond)
    Explore 1,100+ AI Tools on LBB.AI Toolbox (Link, Build, Beyond) – your smart platform for global AI resources. Powered by LBBAI 1.0 intelligent ranking (launched May 2025). Get daily updates on the best tools to automate tasks and stay ahead in the AI revolution.

    Apply for Link Submission About

    wechat QR CodeLBB.AI Toolbox (Link, Build, Beyond)
    wechat QR Code
    Copyright © 2025 LBB.AI (Link, Build, Beyond)   
    网址
    网址文章
    热门搜索
    PPTexcelPhotoRestorationPapervideoHumanPhoto