Hey, this is awesome work!
Are there any tips, scripts or items I should be looking at for training on a separate corpus?
Or similarly, any documented methods for adding additional material into the model?
scratch that - looks like just go ahead and utilize pyserini with or without dense indexes based on desired behavior - is that a correct read?
And then utilize DPR as appropriate for bidirectional encodings?
Hey, this is awesome work!
Are there any tips, scripts or items I should be looking at for training on a separate corpus? Or similarly, any documented methods for adding additional material into the model?