For NQ, it seems in your self-mined hard negatives training set hn.json, there are 70076 queries. But in the original training set downloaded from DPR (biencoder-nq-train.json), there are only 58880 queries. Can I ask where these extra queries are from?
Hi :),
For NQ, it seems in your self-mined hard negatives training set
hn.json
, there are 70076 queries. But in the original training set downloaded from DPR (biencoder-nq-train.json), there are only 58880 queries. Can I ask where these extra queries are from?Thanks in advance.