flageval group tries to add the clcc dataset to moonshot-data.

aiverify-foundation / moonshot-data

Contains all assets to run with Moonshot Library (Connectors, Datasets and Metrics)

Apache License 2.0

11 stars 10 forks source link

flageval group tries to add the clcc dataset to moonshot-data. #25

Closed eyuansu62 closed 2 months ago

eyuansu62 commented 3 months ago

CLCC (Chinese Linguistics & Cognition Challenge) is a Chinese evaluation dataset from the FlagEval team at BAAI (Beijing Academy of Artificial Intelligence), which its results are judged by an judge model. This judge model, FlagJudge, is based on Llama-33B and trained on the CLCC dataset with 40,000 manually labeled examples.