Open koaning opened 4 years ago
It might be good to integrate with [https://github.com/explosion/ml-datasets](ml-datasets from the spaCy team).
The IMDB dataset seems useful.
https://github.com/PolyAI-LDN/conversational-datasets/tree/master/opensubtitles
https://huggingface.co/nlp/viewer/?dataset=cornell_movie_dialog
https://huggingface.co/nlp/viewer/?dataset=rotten_tomatoes
https://huggingface.co/nlp/viewer/?dataset=daily_dialog
https://github.com/RaRe-Technologies/gensim-data
https://github.com/facebookresearch/SentEval
http://ixa2.si.ehu.es/stswiki/index.php/STSbenchmark
https://www.aclweb.org/anthology/2020.lrec-1.186.pdf
https://github.com/BSU-CAST/KidSpell
https://www.aclweb.org/anthology/2020.lrec-1.586.pdf
It might be good to integrate with [https://github.com/explosion/ml-datasets](ml-datasets from the spaCy team).
The IMDB dataset seems useful.