Closed steven-channel closed 1 month ago
While running some evaluation on the MIRACL-Korean benchmark, I'm noticing that the qrels and corpus files contain duplicate IDs which is causing some errors. Is this being handled somewhere?
Seems like a similar issue was in Anserini?
https://github.com/castorini/anserini/issues/720
While running some evaluation on the MIRACL-Korean benchmark, I'm noticing that the qrels and corpus files contain duplicate IDs which is causing some errors. Is this being handled somewhere?