JuliaText / TextAnalysis.jl

Julia package for text analysis
Other
374 stars 96 forks source link

Better sentiment analysis model #84

Open ayush1999 opened 6 years ago

ayush1999 commented 6 years ago

The current sentiment analysis model isn't very good, and needs to be changed (as discussed with @aviks ). Also, following the discussion in #83 , it'd be better to warn the user before skipping works not in vocabulary.

aviks commented 5 years ago

Benchmarking our existing model against standard datasets would be good place to start.

ksteimel commented 5 years ago

Sentiment analysis is a task that can be hard to transfer across different domains. Positive movie reviews may use different terminology when compared to positive video game reviews.

Should the emphasis be on making a generalizable model or making a decent model that you can easily augment with your own data in the target domain?

aviks commented 5 years ago

The latter, I would think. In any case, getting a good baseline+testcases would help in either case.

ksteimel commented 5 years ago

I have a repository where I benchmarked the current system.

The performance is very poor at 52% accuracy (only slightly above random chance).

This may be related to issue #129

Shashi456 commented 5 years ago

@aviks In issue #129, you said the proposed change breaks the test cases, would you suggest fixing the model or writing a new one from scratch?