Closed abbysticha closed 1 year ago
Hi @abbysticha, thank you for your question.
-nq
models we did use the Facebook DPR models as mentioned here -- https://github.com/facebookresearch/DPR#new-march-2021-retrieval-model. I'm not sure if the huggingface models are the same so can't comment on that. But I would use their script to fetch the models -- https://github.com/facebookresearch/DPR/blob/main/dpr/data/download_data.pyn_docs=5
and during evaluation we used n_docs=10
Hope this helps!
Hello, thank you very much for making this baseline code available. I have tried to reproduce the results for the -nq and -ft models from the paper for Task I and Task II. I am getting lower results for F1, EM, and BLEU, especially for the -nq models. I have been reviewing the code to find any bugs that I may have in my own implementation and came across two questions:
Thank you again for your help!