sebastian-hofstaetter / matchmaker

Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
https://neural-ir-explorer.ec.tuwien.ac.at/
Apache License 2.0
261 stars 30 forks source link

Further Training ColBert on domain specific dataset. #19

Open ashutosh-dwivedi-e3502 opened 2 years ago

ashutosh-dwivedi-e3502 commented 2 years ago

How do you suggest we can further train the per-trainined ColBert model form this repo on our domain specific dataset. Currently the repo only supports training the model from scratch on MS-MARCO dataset.

What should be the loss function for this, what should the config look like?