
FlagEval
FlagEval (Libra) is a large model evaluation platform developed by BAAI in collaboration with multiple university teams. It employs a 'Capability-Task-Metric' three-dimensional evaluation framework to provide comprehensive and detailed assessment results, aiding researchers and developers in gaining deep insights into model performance.