Open chrisroederucdenver opened 2 months ago
There's a follow-on to the Coverage report that goes beyond looking at processing errors or development progress, and considers the "garbage" question. Of the data elements that didn't make it into OMOP which are left behind because there's not an obvious place to put them where they will be found?
A follow-on to that is to evaluate the utility of that data and consider future work that would make it available.
More process-related info like the longer list of providers involved in care, the distinction between medication administration and medication request (order, prescription) will be there. Maybe marital status and religious affiliation. I'm not even thinking about stuff that DOES have a place in OMOP like the ADT transfers (?) between units in the hospital (I barely know this stuff). And things that are less likely to make a de-identified database like names, addresses and relatives.
@AdamLeeIT not to "Squirrel!" a rabbit hole here, but there will be a third branch to the comparison when we get EHR data from AoU for the same patients we're querying from the HIE networks. So we'll have OMOP data from patients that AoU acquired by other sources and we can compare to that.
In the FHIR group meetings Stephanie mentioned traceability from resulting OMOP IDs back to source IDs in FHIR or CCDA. This ties directly into the linkage mentioned here for a coverage report. @AdamLeeIT The focus here is to be able to compare the two, but the same data will allow working back from resulting OMOP into the source data.
The following issues are all tied-together with the ideas of identifying documents and sections and what they contain, and then checking if it made it into OMOP or not.
81 is about a new table to track what rule and CCDA document a row in OMOP came from
75 is about #81 having a document ID to work with knowing what CCDA version it is
43 looks at the problem from the POV of new data arrival, or triage
46 (closed) is a duplicate
45 (abandoned) was originally about a google sheet to track progress of what we can process, the task matrix
51 (abandoned) is more about the task-matrix
48 asks us to investigate the logs and rule failures
32 also about the logs
130 in the Coverage Report, distinguish missing mappings
The idea is that on-arrival, we know what we should be mining out of a new document. On processing, we should be able to see what we have and haven't gotten out of it, and in some cases an error message relating to the cause. In some cases shortfalls will be about odd data or bugs in the code. In other cases, likely much more uniform, it will be about code that hasn't been written.
Some of those tickets are just evolving ideas, others are ToDos:
That much is just data. The table needs schema, queries and visualization. Sketches below.
See also https://docs.google.com/document/d/1EEFeiAuD1LoFvgB3RGhIzVfrHIqsBg8gLVcd0rJHPk4/edit#heading=h.7xwnrv99okb6