Closed vojtechhuser closed 3 years ago
example values for such approach:
measure_id median percentile10 min max
DrugEra:ConceptCnt 1669 593.4 0 1894
DrugExposure:ConceptCnt 11813 1636.2 1461 41084
DataQuality package provides more reference values. (from 10 datasets). https://github.com/OHDSI/StudyProtocolSandbox/blob/master/DataQuality/inst/csv/empiric_reference.csv
possible approach to rules may look like this: https://github.com/OHDSI/StudyProtocolSandbox/blob/master/DataQuality/inst/csv/empiric_rule.csv
Heel has been superseded by DQD and is no longer under development.
Assuming thresholds can be changed dynamically (see other related issues on github about this), the ongoing DQ study is using the following methodology to arrive at empiric (notification grade) thresholds.
From the draft DQ study manuscript:
Also, another paste is
What would be a better approach? (again, it is meant for notification and for initial Heel report). The best scenario is where DQA thresholds can be tweaked by data customer. (even per database or database type)