WladimirSidorenko / PotTS

The Potsdam Twitter Sentiment Corpus
MIT License
17 stars 4 forks source link

Automatically extract text & sentiment? #3

Open AngledLuffa opened 4 years ago

AngledLuffa commented 4 years ago

Is there a convenient command line tool or a sequence of mmax calls which will extract the spans and the sentiment polarity? The best I figured out so far was to extract one file at a time, which seemed kind of tedious.

WladimirSidorenko commented 4 years ago

Could you please provide an example of the expected output format?

AngledLuffa commented 4 years ago

The ideal end result would be a tab separated file with sentiment \t text. Naturally, any format which can be easily converted to that would be fine as well. I tried looking at mmax, but unless i missed something, it would be a bit tedious to extract that information. Any script or any suggestion on how to do this would be helpful.

Thanks!