cyhex / streamcrab

Real-Time, Twitter sentiment analyzer engine
http:/www.streamcrab.com
144 stars 49 forks source link

other language #15

Closed ilnurmanapov closed 10 years ago

ilnurmanapov commented 10 years ago

Timor, how to add russian language in setting for analysis of the russian tweets?

cyhex commented 10 years ago

streamcrab was develop for English only. However you can adjust to use Russian (or any other language) by implementing the smm.classifier.textprocessing.TwitterProcessor using Russian stemmer and Russian stopwords.

You would also need to an adjust smm.datastream.plugins.twitterworker.TwitterWorker and null.TwitterCollector to filter for Russian language.

Hope it helps, if you do succeed please contribute back :)