Closed Barianc closed 6 months ago
Hi, thanks for your interest! Your understanding is correct, in our implementation, we use a fixed hop specialized by the dataset to submit the highest scores to the leaderboard. As for the method mentioned in the paper, it will cause a performance decrease of approximately 5%. So you can also use a predefined threshold in your scene if the threshold is appropriate.
@canghongjian I also have some follow-up questions about this issue. Thank you for every response to issues in this repo.
@yc-song All the performance reported in the paper is based on a predefined hop, which is the same as the code.
Hello, thank you very much for your work. However, I have the following questions: In your paper, you mention
So, according to my understanding, the beam retriever will stop automatically at the appropriate step (hop). Yet, you set a fixed
hop
for each dataset, such ashop = 2
in HotpotQA. Why is that? This framework should be adaptable to variable hops. Or am I misunderstanding your paper or code?I hope you can clarify my confusion, thanks again!