Closed sahel-sh closed 5 months ago
@jasper-xian given your previous cl about pyserini_retriever, you are a good candidate for this change which would be much simpler. Do you have the bandwidth to work on this?
yup I can take this
it is yours, thank you!
Thank you @jasper-xian for working on this!
Currently, retriever.from_.. for dataset and custom index does not take k as a parameter, they should be able to take it and pass it down to pyserini. The candidate file names used for storing and reusing the retrieved_results should also have this parameter included, so that retrieving top 20 does not create a false collision with retriving top 100 by having the same file name.