Deeper understanding of Weka - Error Metrics

Sales-Choice-Volunteering-Project / EmotionAnalyzerWeka

The program for obtaining emotion data

1 stars 0 forks source link

Deeper understanding of Weka - Error Metrics #7

Open damirsaleschoice opened 3 years ago

damirsaleschoice commented 3 years ago

Open a non-nominal example train with any algorithm

You will see an output such as Correlation coefficient 0.937 Mean absolute error 30846523146.2126 Root mean squared error 38225020343.6509 Relative absolute error 32.4817 % Root relative squared error 34.8704 % Total Number of Instances 144

Learn what each one of them means, and what role they play in the error metrics.

What in your opinion would be the best error metrics ?
What in your opinion would be the error metrics displaying an over-trained model ?

sherlockliang888 commented 3 years ago

It depends on how we want to treat outliers I guess: if we want to ignore them then MAE is a better choice, but if we want to account for them then MSE/RMSE may be a better choice. I would prefer to use RMSE as it shows differences in both directions, and it is more commonly used (I think). I know that we can learn whether a model is over-trained or not from validation set, but I'm not too sure which metrics to look at for it...but I think when the number of instances is small, the model might be over-trained.

damirsaleschoice commented 3 years ago

Cool. I usually just look at the Correlation coefficient, and if it is too high (beyond 0.95) then there is probably some over-training involved... If too low, then it means that the model is not so good... only then I look at the other metrics.

I will close this ticket as DONE

sherlockliang888 commented 3 years ago

This is helpful, thanks.