DSACMS / dedupliFHIR

Prototype for basic deduplication and aggregation of eCQM data
Creative Commons Zero v1.0 Universal
8 stars 0 forks source link

Check Inputs for Presence of Unique Blocking Values #60

Closed IsaacMilarky closed 3 weeks ago

IsaacMilarky commented 4 weeks ago

Check Inputs for Presence of Unique Blocking Values

Problem

Certain inputs are causing complex errors. Further info can be found on the relevant issue page for #58

Solution

Check the inputs for proper contents. In this case it was the lack of SSNs present in the data.

Result

I have added a pre-parse step to the dedupliFHIR CLI that throws an assertion error if there are less than 5 unique values for any of the blocking rules. The proper user response to this error is to tweak the blocking rules defined in the settings file.

Test Plan

Test on the given data as well as testing blank values for the other default blocking columns.