UKPLab / sentence-transformers

State-of-the-Art Text Embeddings
https://www.sbert.net
Apache License 2.0
14.88k stars 2.44k forks source link

Domain-specific fine-tuning + OOV words #835

Closed fcggamou closed 3 years ago

fcggamou commented 3 years ago

Hi,

I'm training a SentenceTransformer by using an existing spanish MLM model (bert-base-spanish-wwm-uncased) on a smallish labelled dataset. So far it works pretty well, but I'm trying to push it a bit more.

Hoping you can give me some pointers on two points:

Thanks!

nreimers commented 3 years ago

Hi,

fcggamou commented 3 years ago

Thanks a lot for your answer!