THUDM / LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
MIT License
675 stars 54 forks source link

Any Implementation of Mistral-7B? #54

Open leeyeehoo opened 9 months ago

leeyeehoo commented 9 months ago

Hi, do you report the Mistral-7B results? Thank you!

bys0318 commented 9 months ago

Hi, we haven't officially evaluated Mistral-7B on LongBench. But I have seen this paper carried out the evaluation.