TresAmigosSD / SPDA

SMV Public Data App
0 stars 0 forks source link

Problem clarification #1

Closed jackyact closed 8 years ago

jackyact commented 8 years ago

The driver of the hospital master is from Medicare, "Hospital General Information" (DF1). The cross-talk table is from Dartmouth, "hosp_hsa_hrr_2013" (DF2). The task is to link DF1 and DF2 by fuzzy-matching of name, address, etc.

Questions: both tables have provider_ID, a 5-digit unique ID. A quick analysis shows:

If this is the case, are we applying the fuzzy match algorithm to the unmatched records only? Thanks,

ninjapapa commented 8 years ago

It is great! Sounds like CMS made great progress on their data! No need to do the fuzzy match any more. Need to rethink the goal of this project.