Which datasets are used?

ramsrigouthamg / Questgen.ai

Question generation using state-of-the-art Natural Language Processing algorithms

https://questgen.ai/

MIT License

910 stars 294 forks source link

Closed thomas-chauvet closed 4 years ago

thomas-chauvet commented 4 years ago

Hello,

This project is really interesting!

Could you share which datasets are used for training? I didn't find it in the code.

Thanks in advance!

ramsrigouthamg commented 4 years ago

Mainly Quora Question pairs, BoolQ, Squad and MSMarco are the datasets used

thomas-chauvet commented 4 years ago

Thank you for your answer!