probcomp / bayeslite

BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
http://probcomp.csail.mit.edu/software/bayesdb
Apache License 2.0
921 stars 63 forks source link

Create automated validation for Satellite analysis #199

Open tibbetts opened 8 years ago

tibbetts commented 8 years ago

checking that statements in the text of the notebook stay true. Specifically:

tibbetts commented 8 years ago

One example, which was checked in as https://github.com/probcomp/bdbcontrib/blob/54d3025c7bd930d77d3fa3d25c89f414d611b9d8/examples/satellites/Satellites.ipynb, where longitude_in_radians_of_geo is not clustered with geopolitics. image

axch commented 8 years ago

Additional instances of potential instability:

axch commented 8 years ago

@riastradh-probcomp says (in probcomp/bdbcontrib#42, which is a duplicate of this): (a) We need to determine how to assess the stability of phenomena for our demos. (b) We need to find stable phenomena for our demos. (c) We need to automatically test these in our demos.

axch commented 8 years ago

Hypothesis: all the predictive probability of Orion 6, SDS-III 6, and SDS-III 7 periods comes from positing that they are singletons

axch commented 8 years ago

More outstanding questions on what exactly we want to validate:

gregory-marton commented 8 years ago

There is now some stability code in bdbcontrib/examples/satellites -- thanks @axch!

What it lacks (apart from being a particular set of queries that may or may not stay in the notebook) is a set of boundary conditions on what is acceptable. Suggestion is to look at the existing values, assert that they won't get (much) bigger, and push on those boundaries until the tests are no longer flaky, and at the same time we're still seeing the answers we want.