princeton-nlp / HELMET

The HELMET Benchmark
https://arxiv.org/abs/2410.02694
MIT License
51 stars 7 forks source link

Making the full eval sheet a read-only excel #3

Closed LeoXinhaoLee closed 1 week ago

LeoXinhaoLee commented 1 week ago

Thank you so much for releasing this wonderful benchmark and eval results on a lot of models!

I'm wondering if it is possible to make the eval result sheet a read-only excel sheet instead of html, which will make it much easier to copy eval number.

A lot of thanks!

howard-yen commented 1 week ago

Thanks for your interest, I have updated the link in the readme to a read-only google sheet, please let me know if you have any other questions!