JuliaText / CorpusLoaders.jl

A variety of loaders for various NLP corpora.
Other
32 stars 13 forks source link

Add Some sentiment datasets #13

Closed oxinabox closed 5 years ago

oxinabox commented 6 years ago

Maybe http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/

though that would kinda want porting a twitter tokenizer into word tokenizers.jl https://github.com/JuliaText/WordTokenizers.jl/issues/3

ComputerMaestro commented 5 years ago

@oxinabox , This site for Twitter dataset is not loading now. When I started writing code for this it was working but it is not available now it seems. So I am using link and dataset from this http://help.sentiment140.com/for-students . By stanford link https://cs.stanford.edu/people/alecmgo/trainingandtestdata.zip .