THUDM / LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
MIT License
614 stars 43 forks source link

Include data on which passage contains answer #60

Open danielmisrael opened 6 months ago

danielmisrael commented 6 months ago

For Multi-DocumentQA is there a simple way to know which passages are being retrieved from for a given answer? It would be very helpful to have a field called 'retrieval_indices' which contains the indices of the passages used to obtain an answer. Thank you!

bys0318 commented 6 months ago

Thanks for your suggestion! We will consider adding this feature to our dataset.