Training Data for Page-level Document Retrieval

facebookresearch / GENRE

Autoregressive Entity Retrieval

Other

765 stars 103 forks source link

Training Data for Page-level Document Retrieval #75

Closed Chriskuei closed 2 years ago

Chriskuei commented 2 years ago

Hi, thanks for the great work!

In the paper, it says that GENRE is trained on BLINK and all KILT data simultaneously. As we know, BLINK data and other data are not of the same order of magnitude. Are any strategies applied to balance the data? Or just mix all the data together for training?

nicola-decao commented 2 years ago

Hi, no I did not apply anything, I just mixed the data.

Chriskuei commented 2 years ago

Thanks for your quick reply!

Do you have GENRE performance for page-level retrieval on KILT dev data?