Snowflake-Labs / snowflake-arctic

Apache License 2.0
511 stars 41 forks source link

Detailed Accuracy Metrics for Individual Benchmarks #21

Open taesiri opened 4 months ago

taesiri commented 4 months ago

Hi,

First of tall thank you so much for releasing this awesome model.

We want to include Snowflake-Arctic in our study, and we need to have detailed accuracy metrics for individual benchmarks. The official article syas that the "Common Sense" metric is an average of 11 metrics, and the "Coding" category also appears to be an average of HumanEval and MBPP. Could you provide the detailed scores for these individual benchmarks?

Thanks!