intrahealth / client-registry

Open Client Registry Service
https://openclientregistry.com
MIT License
22 stars 36 forks source link

Feature request: 'm' and 'u' values for probablistic matching #26

Closed mugootugoo closed 3 years ago

mugootugoo commented 4 years ago

How to select 'm' and 'u' values, if the feature is supported?

citizenrich commented 4 years ago

These are entered into the decision rules config file. They must be known beforehand.

Determining the proper values are left to the administrator of the system, but generally you'd need to explore what makes sense for your use case, minimum data set, and sample of the data to be matched. It's a statistical exercise that must be done externally.

There's interest on the part of the maintainers to make trying different values of these a feature within the system itself in the future. If you're interested in that let us know.

ashaban commented 3 years ago

@mugootugoo here is a template that shows how m and u values are defined https://github.com/intrahealth/client-registry/blob/master/server/config/decisionRulesProbabilisticTemplate.json