princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
MIT License
378 stars 37 forks source link

Can you provide a validation dataset? #35

Open kuang1216 opened 1 month ago

kuang1216 commented 1 month ago

Thank you very much for your work. I see that the TyDi QA dataset only contains the following data. image However, in the run_eval.py file under the TyDiQA directory, image It appears that No such file or directory: '/LESS/data/eval/tydiqa/dev/tydiqa-goldp-v1.1-train.json,Is it an issue with the dataset I downloaded or the code?