This PR attempts to add in the research done as part of the suicidality studies around propensity score matching as a standard statistical tool in the library.
This attempts sole to deal with the PSM logistics, but not the cli/study logistics. So here's what it's trying to do:
Provide a new table builder extension for PSM
Provide PSM specific jinja templates
Creates a toml input format for configuring PSM jobs
Ancilarily provides a conftest fixture for running tests with duckdb
It does not attempt to do the following, for cognitive load reasons, which will follow on later:
run PSM from a manifest
metadata table peristance for managing multiple PSM tables
cli commands around optionally running/cleaning PSM tables
PSM date filtering on cohorts, which needs some more definition
Checklist
[X] Consider if documentation (like in docs/) needs to be updated
[X] Consider if tests should be added
[X] Run pylint if you're making changes beyond adding studies
[X] Update template repo if there are changes to study configuration
This PR attempts to add in the research done as part of the suicidality studies around propensity score matching as a standard statistical tool in the library.
This attempts sole to deal with the PSM logistics, but not the cli/study logistics. So here's what it's trying to do:
It does not attempt to do the following, for cognitive load reasons, which will follow on later:
Checklist
docs/
) needs to be updated