Hi, I am replicating your code and I don't know how to get candidata_file.
I followed your instructions:
2.Prepare the dataset for multiprocessing:
Generate the validation sets (BM25 results from Anserini) via matchmaker/preprocessing/generate_validation_input_from_candidate_set.py
And I found this project castorini/anserini to get BM25 results for MS MARCO Passage Ranking. But it just shows the ranked doc_id of query rather than the result like '2 Q0 1782337 1 21.656799 Anserini' which is from matchmaker/preprocessing/generate_validation_input_from_candidate_set.py file.
So I wanna ask for your help about how I should get candidate_file. I will appreciate it if you could provide some more detailed guidance. Thanks a lot~
Hi, I am replicating your code and I don't know how to get candidata_file. I followed your instructions:
And I found this project castorini/anserini to get BM25 results for MS MARCO Passage Ranking. But it just shows the ranked doc_id of query rather than the result like '2 Q0 1782337 1 21.656799 Anserini' which is from
matchmaker/preprocessing/generate_validation_input_from_candidate_set.py
file.So I wanna ask for your help about how I should get candidate_file. I will appreciate it if you could provide some more detailed guidance. Thanks a lot~