catalyst-cooperative / ccai-entity-matching

An exploration of generalizable approaches to unsupervised entity matching for use in linking tabular public energy data sources.
MIT License
1 stars 2 forks source link

Integrate splink matching model into pipeline #32

Closed katie-lamb closed 8 months ago

katie-lamb commented 1 year ago

Currently the matching model I've built with splink is in a notebook. I'm going to integrate this into a matching module in the repo so that as we develop blocking methods we can run the candidate set through splink to evaluate how well it's performing.