Training for Contextual Document Embeddings (CDE) model?

UKPLab / sentence-transformers

State-of-the-Art Text Embeddings

Apache License 2.0

15.45k stars 2.5k forks source link

Hello!

Thanks for the suggestion. I think it really depends on how elaborate the training approach is. The original code wasn't release, so it's a bit hard to tell. The model also does a few other tricks during training (e.g. false negative filtering, training data clustering) that make the model stronger than otherwise. At the moment I'm thinking that it might not make sense, although some of the components might be useful to implement, e.g. a batch sampler that clusters training data using some SentenceTransformer model (perhaps a StaticEmbedding-based one like tomaarsen/static-bert-uncased-gooaq).

Tom Aarsen

UKPLab / sentence-transformers

Training for Contextual Document Embeddings (CDE) model? #2985