Open maytepenella opened 6 years ago
Implemented in notebook "Machine Learning IIpy36.ipynb" from 11_SupervisedLearning classes
for code see my notebook
We have the following percentages of tweets:
We are relativelly good at classifying bad tweets, but neutral and also possitive tweets are degrading performance a lot. Maybe we should consider a first division among tweets with sentiment and tweets without sentiment (as described in the reference provided by Laia https://github.com/ayushoriginal/Sentiment-Analysis-Twitter )
Also we should analyze why we fail at possitive tweets.
The matrix is divided in four quarters and contains:
True Positives (TP): Positive samples predicted as such. True Negatives (TN): Negative samples predicted as such. False Positives (FP): Negative samples predicted as positive. False Negatives (FN): Positive samples predicted as negative.
Should we consider a 3 by 3 matrix? (possitive, neutral, negative)