cladteam / CCDA_OMOP_by_Python

2 stars 3 forks source link

Uber CLAD DQ MVP checks, SQL and visualizations #95

Open chrisroederucdenver opened 2 months ago

chrisroederucdenver commented 2 months ago

See update comment at bottom.

See tab for MS.11 here. The MVP ones have a "YES" in the column for Include in June Deliverable. June is a reference to something else. https://docs.google.com/spreadsheets/d/19cs4hTpLw9maU8UuSIcaj_0JSfcqSKcLQ6ESHy5_yFo/edit?gid=319835440#gid=319835440

Also, since we'll be running on Foundry, these queries should be written in Spark SQL and be part of a/the pipeline.

Pre-tasks

Counts

Post Tasks

chrisroederucdenver commented 2 months ago

consider how CLAD DQ "post-linkage" checks is different from the PPRL checks Isa Francisco is doing, and how those are different from evaluation plans the consider augmentation and concordance for the pilot project. Should be lots of overlap, don't want to duplicate...

chrisroederucdenver commented 1 month ago

Priority from meeting today: First 3 could be started:

This is next:

We're not sure if we're getting Death data. Open Question.

chrisroederucdenver commented 1 month ago

Deceased Observation: https://www.hl7.org/ccdasearch/templates/2.16.840.1.113883.10.20.22.4.79.html uses templateID 2.16.840.1.113883.10.20.22.4.79 SNOMED code 419099009 "Dead"

chrisroederucdenver commented 1 month ago

@AdamLeeIT Hello!

chrisroederucdenver commented 1 month ago

tagging @bryanlaraway for FYI

tannerzhang commented 1 month ago

Adding James Cavallon for the DQ work per Steph's direction. I am finding James' Github handle

chrisroederucdenver commented 1 month ago

See #118 about the back-end queries. See #112 (closed/defered for now) for the inevitability of Site IDs. It's an open question, but will impact the visualization. The back-end should be modified to have a single site, "#1" and the viz should be able to handle multiple values when/if they arrive. This might be better with grouped bar charts than a heat-map. This seems OK because my current guess is we won't have 100 sites, but it is just a guess.

AdamLeeIT commented 3 weeks ago

updated data to slate for CCDA DQ checks

chrisroederucdenver commented 3 weeks ago

Looks like Matt fixed the Postgres sync for ccda_counts_by_birth_month, I marked it done.