issues
search
neulab
/
ExplainaBoard
Interpretable Evaluation for AI Systems
MIT License
359
stars
36
forks
source link
Support more QA tasks?
#54
Open
pfliu-nlp
opened
2 years ago
pfliu-nlp
commented
2 years ago
[ ] NQ (
https://ai.google.com/research/NaturalQuestions/dataset
)
[ ] TriviaQA (
https://nlp.cs.washington.edu/triviaqa/
) datasets
[ ] HotpotQA
[ ] DROP
divija96
commented
2 years ago
Mentioning other popular QA datasets here for reference:
[ ]
WebQA
: Multimodal hence might be difficult to evaluate.
[ ]
BioASQ
: Biomedical semantic indexing.