PAWS dataset negative examples

Vamsi995 / Paraphrase-Generator

A paraphrase generator built using the T5 model which produces paraphrased English sentences.

MIT License

310 stars 66 forks source link

Hi Vamsi,

I would be interested how did you deal with the negative paraphrase examples from the PAWS dataset, e.g. "Although interchangeable, the body pieces on the 2 cars are not similar." vs. "Although similar, the body parts are not interchangeable on the 2 cars." (which are not paraphrases as described here: https://github.com/google-research-datasets/paws ). Are they 1) part of the model training, 2) or are they simply ignored during training, 3) or did you adjust the loss function to increase the loss for negative examples?

I would like to use your model as a baseline in my master thesis. Is there anything I can cite?

Best Johannes

Vamsi995 / Paraphrase-Generator

PAWS dataset negative examples #20