bmir-radx / radx-project

This repo serves as a primary location for tracking issues that don't quite fit into our other dedicated repositories
0 stars 0 forks source link

Cursory analysis on Tier-2 data elements from DCCs #34

Open marcosmro opened 8 months ago

marcosmro commented 8 months ago

@muakdogan: focus on RADx-UP. @jkyu: RADx-rad

The plan is to report on the results from this analysis during the next RADx harmonization meeting (8/11).

jkyu commented 7 months ago

We ended up not having the planned harmonization meeting. It was replaced with the cedar integration (into the RADx Data Hub) meeting that week, and we didn't go through the Tier-1/Tier-2 data dictionaries from the DCCs.

I worked on this just now in the context of expanding the harmonization rules currently implemented in the Harmonization Metrics Library, and I focused only on the variable name of each data element.

Referencing the Complete Data Variable Report from BAH, it looks like the Tier-2 data elements from rad and UP appear in both the origcopy and transformcopy. It could be that the harmonization mappings are idempotent for Tier-2 data elements (which would be great for us!) or that the mapping deals with the data element values while keeping the same variable name. This can be verified by looking at the origcopy and transformcopy of the data dictionaries submitted with each data file.

jkyu commented 5 months ago

Closing this out. See previous message.