princeton-nlp / HELMET

The HELMET Benchmark
https://arxiv.org/abs/2410.02694
MIT License
51 stars 7 forks source link

Is the code of eval_gpt4_longqa.sh is correct? #4

Closed enze5088 closed 1 week ago

enze5088 commented 1 week ago

The code in it seems to be Python code rather than a shell script.

howard-yen commented 1 week ago

Thanks for catching this, it's been fixed!