How do I get the context in AmbigNQ?

shmsw25 / AmbigQA

An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"

117 stars 22 forks source link

Hi @HuaYZhao, thanks for your interest.

First, as AmbigNQ is an open-domain QA task, there is no provided context. The available supervision is only the answer text - there is no groundtruth start and end positions.

Therefore, you are correct that the ids in AmbigNQ and Wikipedia DB do not match each other.

When experimenting with baselines in the paper, we use Dense Passage Retrieval to retrieve related passages to feed in to the reader model. Note that, however, that this retrieval step is part of the model, rather than part of the data.

shmsw25 / AmbigQA

How do I get the context in AmbigNQ? #1