awslabs / rag-qa-arena

Apache License 2.0
21 stars 2 forks source link

Different docu id #5

Open Yeeebye opened 1 week ago

Yeeebye commented 1 week ago

The document id in annotation.._with citation.json seems to be different with the document id in ..from_colbert_test.json

Any reason to that, or how should I find the documents that are used in the answers?

Yeeebye commented 1 week ago

For example, the gold_docu_id for the first question in lifestyle is 102494, but it's not in lifestyle_from_colbert_test.jsonl given by you, nor in the dev_from_colbert_test.jsonl I got from running the script.