Computing statistics on generated test values

flyingmutant / rapid

Rapid is a modern Go property-based testing library

https://pkg.go.dev/pgregory.net/rapid

Mozilla Public License 2.0

579 stars 25 forks source link

Computing statistics on generated test values #30

Open alfert opened 2 years ago

alfert commented 2 years ago

In Hypothesis and other QuickCheck-like implementations, it is possible to calculate statistics, usually to validate that test data generation works as expected or is skewed somehow, as described here: https://hypothesis.readthedocs.io/en/latest/details.html#test-statistics

While it is easy to implement something like the event() function of Hypothesis for rapid, generating reports is not. There are no means for decorating a property (that I am aware of) and calling a PrintStats() function at the end of the property will be run every time (i.e. 100 times for a single property). What seems to work is the following:

func TestStackRapid(t *testing.T) {
        defer stats.PrintStats(t)
    rapid.Check(t, func(t *rapid.T) {
             ....
             stats.Event(fmt.Sprintf("some event"))
             ...
       }
}

Is that a hack or an intended way of decorating a property? And: are you interested in an implementation for such statistics?

flyingmutant commented 2 years ago

To be honest, I've never actually used such reporting functionality, so I do not understand this use case fully. Your code seems fine to me -- although defer might not be necessary:

func TestStackRapid(t *testing.T) {
    rapid.Check(t, func(t *rapid.T) {
             ....
             stats.Event(fmt.Sprintf("some event"))
             ...
       }
       stats.PrintStats(t)
}

As for implementing reporting inside rapid itself, right now I am not a fan: I think this is quite narrow use case because we should not show such things by default (I believe tests should be silent by default), and when it is hidden behind an option, not many people will benefit from it.

alfert commented 2 years ago

Thanks, to do the printing after calling Check is indeed a good idea.

The use case for such reporting is important when you define your own generators. How do you ensure that you generate good test data? You might do some test coverage to find out if your relevant code paths are selected. But what do you do if this is not the case? In that case you need to understand what your generator is producing, not only a few samples, but in your tests. In such situations, a reporting tool comes quite handy and is standard feature of QuickCheck from its inception in Haskell and its commercial version in Erlang. But also other implementations provide that functionality, e.g. Hypothesis (Python), PropEr (Erlang, Elixir), ScalaCheck (Scala).

So, reporting is a kind of debugging tool for test developers.

flyingmutant commented 2 years ago

Thanks for the explanation! Do you have in mind how it should look in rapid? Something close to Hypothesis?

alfert commented 2 years ago

Since we are close to Hypothesis and do not have the FP restrictions (and abilities), I would suggest to model it generally after Hypothesis. In my simple implementation, I added a Event(t *rapid.T, event string) function to collect events to a given test. That can be printed by above's PrintStats function. I use t.Logf for output, so a test -v will output the stats.

I can provide a PR for this.

flyingmutant commented 2 years ago

Sounds interesting, let's see the PR (can't promise fast review right now, unfortunately).