dedupeio / dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
https://docs.dedupe.io
MIT License
4.15k stars 551 forks source link

extend index predicates to whole model #1144

Open fgregg opened 1 year ago

fgregg commented 1 year ago

right now the index predicates work on the model, but it would be interesting, particularly for the bag-of-word predicates, to consider predicates that operated over the whole field.