IndoNLP / nusa-crowd

A collaborative project to collect datasets in Indonesian languages.
Apache License 2.0
260 stars 61 forks source link

Create dataset loader for ELI5_ID #370

Open SamuelCahyawijaya opened 7 months ago

SamuelCahyawijaya commented 7 months ago

NusaCatalogue: https://indonlp.github.io/nusa-catalogue/card.html?eli5_id

Dataset eli5_id
Description ELI5 is a dataset for long-form question answering. It contains 270K complex, diverse questions that require explanatory multi-sentence answers. Web search results are used as evidence documents to answer each question. This one is translated version in Indonesia
License Unknown