Open h-vetinari opened 6 years ago
You can do a run with --record-samples
and then use asv show --details
to inspect the actually measured values. Without those, hard to see there's a bug... More likely, the measurement results are just too noisy to claim there has been a significant change in the result --- i.e. not changed significantly.
OK, it could of course be that it was intentional - forget the word "bug" then.
But if something increases dramatically (noisy or not), most people likely really would like to know (and would be able to determine themselves if the noise is relevant or not), rather than falsely be placated by reading BENCHMARKS NOT SIGNIFICANTLY CHANGED.
I mean, isn't that the whole point of the cut-off (e.g.) -f 1.1
? To throw away small small differences mostly due to noise? If something is suddenly 67% slower, I think that's relevant to know.
Because of the noise, you don't know if something increased dramatically: the values printed are only the medians. If there's too much spread in the results to even state with confidence the two measurements gave different results, it's not possible to state there was a change larger than some ratio.
There can be fine tuning of the statistics, but for that specific results with the measured values would be needed.
Sorry for the late reply.
Because of the noise, you don't know if something increased dramatically: the values printed are only the medians. If there's too much spread in the results to even state with confidence the two measurements gave different results, it's not possible to state there was a change larger than some ratio.
My point is that - despite the spread - the increase in the medians might be a real regression, and hiding those cases by default, and without an option to show them gives a false sense of security to the people using ASV. Maybe someone wants to be strict on those noisy cases as well...?
How about a parameter that allows to include these tests, either through a boolean, or through setting some threshold for the variance etc.?
If the increase of median is not statistically significant, it is not statistically significant, and most likely just spurious reading. You will also get false positives this way.
The option to show everything already exists: asv continuous --no-only-changed
.
What you probably are asking for is an option to provide a three-state report, change/no change/unsure.
An option to simply ignore statistics can also be added in principle.
ASV sometimes adds a
~
to ratios if there are many sample (with high variance, I'm guessing), but there's a bug somewhere how these results are evaluated when the significant changes are given out.As an example from running the ASV in a pandas PR:
Capital letters telling me everything is fine? Not so much unfortunately... (stars added manually)
It just so happened that all the significant changes were hiding behind a tilde...
PS. I just installed from github a few days ago, I'm on
0.4.dev1533+5f9be296
.