National-Clinical-Cohort-Collaborative / Data-Ingestion-and-Harmonization

Data Ingestion and Harmonization
41 stars 12 forks source link

Zip Codes management across all CDMs #40

Open DaveraGabriel opened 4 years ago

DaveraGabriel commented 4 years ago

Zip Code is not a required field in all CDMs. What remedy should be implemented when the Zip code is absent? 1) substitute the zip of the health system / data partner? 2) leave the field blank / null? 3) Ask the sites to produce zip code fields?

hlehmann17 commented 4 years ago

Since zip code will be presumed by the analysts to "belong" to the individual, I think we do a disservice by imputing anything. And providing a site zip code only increases the likelihood of institution reidentification.

DaveraGabriel commented 4 years ago

related to issue #48, closed for consolidation to this one: "Per mapping validation review - the TriNetX data will likely always have 5 digit zip codes - will require consistent reduction to leading 3 digits"