BEIR cqadupstack - Githubissues

jobergum commented 1 year ago

I'm wondering about the BEIR https://ir-datasets.com/beir.html#beir/cqadupstack/* versus https://github.com/beir-cellar/beir dataset? Is there one ir dataset that has all the docs, queries, and qrels for cqadupstack ?

seanmacavaney commented 1 year ago

Hi @jobergum -- cqadupstack works differently than the other datasets in BEIR. Their convention for the dataset is to evaluate on each of the domains independently and average the nDCG@10 score for the final cqadupstack score.

From an ir-datasets perspective, that's 12 separate datasets, each with its own corpus, queries, and qrels. A concatenated cqadupstack dataset might be useful for some experiments, but it doesn't align with their task definition. As such, I'd be hesitant to add it, since it could be confused with BEIR's official way of reporting cqadupstack.

There are more details in this issue: https://github.com/beir-cellar/beir/issues/9#issuecomment-842147129

Hope this helps!

jobergum commented 1 year ago

Wow, thanks for the quick reply @seanmacavaney! I did not realize that it was 12 datasets, then I think it makes a lot of sense. Closing this!

allenai / ir_datasets

BEIR cqadupstack #221