booydar / babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
Apache License 2.0
149 stars 16 forks source link

Leaderboard is down. plz restart the Leaderboard in HF, thx. #5

Open ZetangForward opened 1 day ago

yurakuratov commented 1 day ago

Thanks! It should work now.