DSACMS / dedupliFHIR

Prototype for basic deduplication and aggregation of eCQM data
Creative Commons Zero v1.0 Universal
8 stars 0 forks source link

Allow Configure of Blocking Settings Via JSON file #52

Closed IsaacMilarky closed 2 months ago

IsaacMilarky commented 2 months ago

Allow Configure of Blocking Settings Via JSON file

Problem

Previously, it was not obvious where the blocking settings for the splink linker were defined in the CLI.

Solution

Now, there is a new file called splink_settings.json to define some of the settings of the splink linker object.

Ex:

{
    "link_type": "dedupe_only",
    "blocking_rules_to_generate_predictions": [
         "birth_date",
        ["ssn", "birth_date"],
        ["ssn", "street_address"],
        "phone"
    ],
    "max_iterations": 20,
    "em_convergence": 0.01
}