cancerDHC / operations

for operational functions
1 stars 1 forks source link

1c2 Evaluate data population of nodes data models (which portions are populated, are they consistently populated, etc.) #56

Closed bfurner closed 3 years ago

bfurner commented 4 years ago

The work of evaluating the population and distribution of data within node models has not yet begun. We will reach out to node data model contacts in April and ask them to provide the CCDH team with summary statistics that demonstrate how robustly node models are presently populated. This will inform our ongoing mapping work as we will not dedicate effort to harmonize data elements that are not populated.

bfurner commented 3 years ago
bfurner commented 3 years ago

The data model harmonization team has produced summary statistics that illustrate the current state of PDC and GDC open access data sets, in the subdomains that are presently part of the CRDC-H. We will expand on these profiles as our model work continues and as the DC models themselves evolve.

GDC data profiles are located here PDC data profiles are located here