Pre-training a generic SentenceTransformer for domain adaptation

huggingface / setfit

Efficient few-shot learning with Sentence Transformers

https://hf.co/docs/setfit

Apache License 2.0

2.19k stars 220 forks source link

Pre-training a generic SentenceTransformer for domain adaptation #222

Open zachschillaci27 opened 1 year ago

zachschillaci27 commented 1 year ago

When using SetFit for classification in a more technical domain, I could imagine the generically-trained SBERT models may produce poor sentence embeddings if the domain is not represented well enough in the diverse training corpus. In this case, would it be advantageous to first apply domain adaptation techniques (as discussed here) to an SBERT model before using the model as a base in SetFit? Have you considered and/or tested such an approach?

Thanks for the help!

leoplusx commented 1 year ago

Bumping this. I have the same question.

I'd figure this might improve accuracy on classification tasks as well?

First domain adapt with unsupervised data. Then fit it to the specific task with labeled data, in a very sample efficient way thanks to SentFit.

Wouldn't this yield even better results for all tasks?

Thanks.