Open marco-c opened 9 years ago
100round run^
Without the tails it's not far off from a normal distribution. There are a few statistician people in metrics near my desk. If I get some time a little later I'll discuss with them.
Yeah, the more rounds the better it is for the central limit theorem (the magic number is often 30) Maybe we should draw the quantile-quantile plot after running the benchmarks, so that we avoid wrong conclusions.
(I don't have enough knowledge about statistics to tell if the tails are a problem)
Memory spikes seem to correlate to startup time spikes.
The benchmark samples might not be normally distributed, which makes the Student's t-test results unreliable.
This is a Q-Q plot of one run of the benchmark (30 rounds): Obviously the data is not normally distributed in this case.