Open tibbetts opened 8 years ago
One example, which was checked in as https://github.com/probcomp/bdbcontrib/blob/54d3025c7bd930d77d3fa3d25c89f414d611b9d8/examples/satellites/Satellites.ipynb, where longitude_in_radians_of_geo is not clustered with geopolitics.
Additional instances of potential instability:
@riastradh-probcomp says (in probcomp/bdbcontrib#42, which is a duplicate of this): (a) We need to determine how to assess the stability of phenomena for our demos. (b) We need to find stable phenomena for our demos. (c) We need to automatically test these in our demos.
Hypothesis: all the predictive probability of Orion 6, SDS-III 6, and SDS-III 7 periods comes from positing that they are singletons
More outstanding questions on what exactly we want to validate:
There is now some stability code in bdbcontrib/examples/satellites -- thanks @axch!
What it lacks (apart from being a particular set of queries that may or may not stay in the notebook) is a set of boundary conditions on what is acceptable. Suggestion is to look at the existing values, assert that they won't get (much) bigger, and push on those boundaries until the tests are no longer flaky, and at the same time we're still seeing the answers we want.
checking that statements in the text of the notebook stay true. Specifically: