UKPLab / sentence-transformers

State-of-the-Art Text Embeddings
https://www.sbert.net
Apache License 2.0
14.83k stars 2.44k forks source link

Hyperparameters to train stsb-mpnet-base-v2 #1136

Closed XuandongZhao closed 3 years ago

XuandongZhao commented 3 years ago

I am wondering how can I reproduce the pre-trained models. For example, I think "stsb-mpnet-base-v2" is trained on "ALLNLI" and "STSb" dataset with "MultipleNegativesRankingLoss". Is it right?

Also could you please provide some hyperparameters or scripts you use to train the "stsb-mpnet-base-v2" models, such as "epochs", "learning rate", "batch_size"...

Thank you very much for your help!

nreimers commented 3 years ago

Here the two scripts https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/nli/training_nli_v2.py https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/sts/training_stsbenchmark_continue_training.py

XuandongZhao commented 3 years ago

Here the two scripts https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/nli/training_nli_v2.py https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/sts/training_stsbenchmark_continue_training.py

Thanks, I already read these two scripts. In these two examples, we can see parameters for "distilroberta". Are the hyperparameters all the same for different models?

nreimers commented 3 years ago

Yes

XuandongZhao commented 3 years ago

Yes

Thanks a lot! BTW, could I ask what type of GPU do you use? Is it tesla V100?

nreimers commented 3 years ago

same for me