Paraphrasing / Question Rewriting - Githubissues

Glavin001 / Data2AITextbook

🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)

MIT License

25 stars 2 forks source link

Paraphrasing / Question Rewriting #4

Open Glavin001 opened 1 year ago

Glavin001 commented 1 year ago

Paraphrasing

Question Rewriting

Glavin001 commented 1 year ago

https://github.com/google-research-datasets/paws

This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification.

Glavin001 commented 1 year ago

https://github.com/Vamsi995/Paraphrase-Generator

A paraphrase generator built using the T5 model which produces paraphrased English sentences.
https://huggingface.co/humarin/chatgpt_paraphraser_on_T5_base
https://huggingface.co/datasets/humarin/chatgpt-paraphrases
https://pub.towardsai.net/how-to-do-effective-paraphrasing-using-huggingface-and-diverse-beam-search-t5-pegasus-229ca998d229
https://huggingface.co/eugenesiow/bart-paraphrase

Evaluation