Closed maytepenella closed 6 years ago
Added function get_vocabulary_per_sentiment to train.py
TO DO: discard words with low correlation?
TO DO: Upload version with neutral vocabulary
Modified version that considers all the senitments present in df.
Code has been redone to be more understandable.
For now we will not discard words based on correlation
Change to suit with main.py:
Fixed on last commit
Find out which words are more correlated to possitive and negative sentiment. Maybe we can use the matrix correlation to define best possitive and negative words (+ correlations are good and - correlations appear more in negative sentences)