cladteam / CCDA_OMOP_by_Python

2 stars 3 forks source link

Uber Coverage and Coverage Report Issue #82

Open chrisroederucdenver opened 2 months ago

chrisroederucdenver commented 2 months ago

The following issues are all tied-together with the ideas of identifying documents and sections and what they contain, and then checking if it made it into OMOP or not.

The idea is that on-arrival, we know what we should be mining out of a new document. On processing, we should be able to see what we have and haven't gotten out of it, and in some cases an error message relating to the cause. In some cases shortfalls will be about odd data or bugs in the code. In other cases, likely much more uniform, it will be about code that hasn't been written.

Some of those tickets are just evolving ideas, others are ToDos:

That much is just data. The table needs schema, queries and visualization. Sketches below.

See also https://docs.google.com/document/d/1EEFeiAuD1LoFvgB3RGhIzVfrHIqsBg8gLVcd0rJHPk4/edit#heading=h.7xwnrv99okb6

chrisroederucdenver commented 2 months ago

There's a follow-on to the Coverage report that goes beyond looking at processing errors or development progress, and considers the "garbage" question. Of the data elements that didn't make it into OMOP which are left behind because there's not an obvious place to put them where they will be found?

A follow-on to that is to evaluate the utility of that data and consider future work that would make it available.

More process-related info like the longer list of providers involved in care, the distinction between medication administration and medication request (order, prescription) will be there. Maybe marital status and religious affiliation. I'm not even thinking about stuff that DOES have a place in OMOP like the ADT transfers (?) between units in the hospital (I barely know this stuff). And things that are less likely to make a de-identified database like names, addresses and relatives.

chrisroederucdenver commented 2 months ago

@AdamLeeIT not to "Squirrel!" a rabbit hole here, but there will be a third branch to the comparison when we get EHR data from AoU for the same patients we're querying from the HIE networks. So we'll have OMOP data from patients that AoU acquired by other sources and we can compare to that.

chrisroederucdenver commented 1 month ago

In the FHIR group meetings Stephanie mentioned traceability from resulting OMOP IDs back to source IDs in FHIR or CCDA. This ties directly into the linkage mentioned here for a coverage report. @AdamLeeIT The focus here is to be able to compare the two, but the same data will allow working back from resulting OMOP into the source data.