Open shzamanirad opened 3 years ago
@shzamanirad: The training data for crossencoder is output by eval_biencoder.py
script. For every datapoint in train/test/valid split, the eval script basically outputs top 64 retrieved candidates and calculates recall at various positions (recall@1, @10, @64, etc.).
The top 64 retrieved candidates are further used by the corssencoder as train/test/valid data.
Hi , I had a doubt, If we train the cross encoder model with 64 candidates , can we eval the model with 20 or 30 candidates ?
Hi,
It is mentioned in the paper that:
Can you please provide this training data for cross-encoder?
Thanks.