Open rlee287 opened 6 years ago
Still trying to figure out statistical power, but it should be possible to model the gym results using a binomial process and using something like binom_cdf(result_win, num_trials, 1/num_bots) to calculate the p value.
Implement proper statistics to help decide which version(s) are better.
p-valueConfidence intervals