probcomp / bayeslite

BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
http://probcomp.csail.mit.edu/software/bayesdb
Apache License 2.0
922 stars 64 forks source link

Implement BQL syntax for multiple/joint imputation #458

Open fsaad opened 8 years ago

fsaad commented 8 years ago

Currently

INFER EXPLICIT PREDICT x, PREDICT y FROM p

imputes x and y from their marginals. We need BQL syntax for imputing from the joint distribution

INFER EXPLICIT PREDICT (x,y), PREDICT y FROM p

There are non-trivial UI and implementation questions here -- i.e. how does the result look in a tabular form, since the first PREDICT is bivariate input-output which is not standard SQL.

I recall @riastradh-probcomp did some enormous hack to get PREDICT ... CONFIDENCE working.

fsaad commented 8 years ago

Of course a user can always manually impute using SIMULATE x, y GIVEN z FROM p which returns results from the specified distribution {x,y}|z, and then post-process in their favorite way. But BayesDB (or as it stands now IBayesDBMetamodel) should also have some default imputation strategy based on the stattypes.