National-Clinical-Cohort-Collaborative / Data-Ingestion-and-Harmonization

Data Ingestion and Harmonization
41 stars 12 forks source link

We are missing SARS-Cov-2 Test Results for Sites 6000 #67

Closed agirvin closed 3 years ago

agirvin commented 4 years ago

We are currently missing SARS-CoV-2 Test results for sites 7000 (known) and 6000 (possibly not known).

This the breakdown of test type by site:

image

And these are the result values:

image

As was known, thee site 7000 values aren't mapped. But in the case of site 6000, we actually see the concept_id for the test itself in both the measurement_concept_id and value_as_concept_id column

image

That is, the measurement_concept_name is "SARS coronavirus 2 N gene [Presence] in Unspecified specimen by NAA with probe detection" and the value_as_concept_name is "SARS coronavirus 2 N gene [Presence] in Unspecified specimen by NAA with probe detection"

The analysis is here

DaveraGabriel commented 4 years ago

@agirvin : the datastore you worked with was not ready / intended for your analyses yesterday. We are addressing the "publish to Palanitr" steps / workflow, including a regular cadence (twice weekly) for enclave data refresh to avoid this in the future. However, we are also looking into the information you have provided today. @rajuhemadri

stephanieshong commented 4 years ago

Corrected for site 7000, the qualitative results are set as value as concept id. As for site 6000, the new dataSet will be requested with the corrected value as concept id.