koaning / inlay

a bit of fun embedding in non-standard ways
0 stars 0 forks source link

datasets integration #2

Open koaning opened 4 years ago

koaning commented 4 years ago

It might be good to integrate with [https://github.com/explosion/ml-datasets](ml-datasets from the spaCy team).

The IMDB dataset seems useful.

koaning commented 4 years ago

https://github.com/PolyAI-LDN/conversational-datasets/tree/master/opensubtitles

koaning commented 4 years ago

https://huggingface.co/nlp/viewer/?dataset=cornell_movie_dialog

koaning commented 4 years ago

https://huggingface.co/nlp/viewer/?dataset=rotten_tomatoes

koaning commented 4 years ago

https://huggingface.co/nlp/viewer/?dataset=daily_dialog

koaning commented 4 years ago

https://github.com/RaRe-Technologies/gensim-data

koaning commented 4 years ago

https://github.com/facebookresearch/SentEval

koaning commented 4 years ago

http://ixa2.si.ehu.es/stswiki/index.php/STSbenchmark

koaning commented 4 years ago

https://www.aclweb.org/anthology/2020.lrec-1.186.pdf

koaning commented 4 years ago

https://github.com/BSU-CAST/KidSpell

koaning commented 4 years ago

https://www.aclweb.org/anthology/2020.lrec-1.586.pdf