evalplus / repoqa

RepoQA: Evaluating Long-Context Code Understanding
https://evalplus.github.io/repoqa.html
Apache License 2.0
99 stars 3 forks source link

feat(compute): fetch train size #33

Closed JialeTomTian closed 6 months ago

JialeTomTian commented 6 months ago

Code to fetch train size, hard coded for paid models.

I am currently waiting for repo approval for Llama 3 LOL. I will update the release scores files after I get approval.

ganler commented 6 months ago

That is smart.

JialeTomTian commented 6 months ago

Updated the release with train_size