Closed L-M-Sherlock closed 3 months ago
I am not an expert in statistics, but is this actually an improvement? When there is an uncertainty at the 2nd place of decimal in the RMSE, does it make sense to consider the 3rd and the 4th decimal places?
@Expertium, can you confirm?
We would need to run a statistical significance test. @L-M-Sherlock could you please run my logp_wilcox
(from significance_table.py) on the baseline values of RMSE and the new values?
Like this: log_p_value = logp_wilcox(baseline_RMSE, new_RMSE)[0]
Yep, that's definitely significant. Well, statistically, but not practically, since the effect is only about 0.5%
Weighted average by reviews:
Weighted average by log(reviews):
improved ~0.6% and ~0.5%.