Closed lminer closed 9 years ago
These kinds of questions are better on the mailing list of stack overflow. Thanks!
Is there a specific tag that you use?
not yet. dedupe may be good. we keep an eye out :)
Done, although SO people don't seem to like it: https://stackoverflow.com/questions/31324582/how-do-you-make-a-gazetteer-for-dedupe-when-individuals-have-multiple-addresses
python-dedupe may be more informative going forward. We'll see.
On Thu, Jul 9, 2015 at 1:04 PM Luke Miner notifications@github.com wrote:
Done, although SO people don't seem to like it: https://stackoverflow.com/questions/31324582/how-do-you-make-a-gazetteer-for-dedupe-when-individuals-have-multiple-addresses
— Reply to this email directly or view it on GitHub https://github.com/datamade/dedupe/issues/398#issuecomment-120088948.
Reading the documentation, it seems like a gazetteer needs to have clean, distinct individual-level data. What do you do if the individual has moved, changed jobs, etc a bunch of times? Include multiple observations per individual with the blanks intelligently filled in?