UKPLab / sentence-transformers

State-of-the-Art Text Embeddings
https://www.sbert.net
Apache License 2.0
15.22k stars 2.47k forks source link

Paraphrase Models Data #916

Open hrsmanian opened 3 years ago

hrsmanian commented 3 years ago

Hi, We are using paraphrase-distilroberta-base-v1 for paraphrase identification. Thanks for the work. It gives good results on many tasks. Can you kindly share or point me to the source for the data? We would like to train on a smaller model like bert-small etc. It mentions that it was trained on millions of examples.

Thanks in advance Ram

nreimers commented 3 years ago

Have a look here: https://github.com/UKPLab/sentence-transformers/tree/master/examples/training/paraphrases

Currently this is work in progress

hrsmanian commented 3 years ago

Thanks a lot. This is really useful information