THUDM / LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
MIT License
633 stars 45 forks source link

How to evaluate on llama3-8b-instruct? #71

Open txchen-USTC opened 1 month ago

txchen-USTC commented 1 month ago

How to evaluate on llama3-8b-instruct? Please add the function, thanks!