There are some minor mistakes in the dataset

OpenLMLab / LEval

[ACL'24 Oral] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

GNU General Public License v3.0

314 stars 13 forks source link

Hi! Firstly, I want to express my appreciation for the great work on the dataset designed for the long text challenge; it's a valuable resource, and I'm really glad to see it.

However, there are some minor mistakes that exist. For example, in the first sample of load_dataset('L4NLP/LEval', "coursera", split='test'), the Question 4 occurs twice. According to my test, although this is a minor mistake, it may influence the performance of the experimental results. So I want to point it out~

Looking forward to your update. Have a good day~

OpenLMLab / LEval

There are some minor mistakes in the dataset #1