dedupeio / dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
https://docs.dedupe.io
MIT License
4.11k stars 551 forks source link

Model fields that require a corpus cannot be searched for values not in the corpus #422

Closed evz closed 8 years ago

evz commented 8 years ago

When using a Gazetteer instance to find matches in a dataset that has a model that uses a field that requires a corpus, if the value being searched for is not already in the corpus, Dedupe will return an error.

fgregg commented 8 years ago

Closed by https://github.com/datamade/fuzzycategory/commit/775db757ac4ba5bb710b46a2c52eefe94d19f08f