Closed deanthemachine closed 10 years ago
Hmm.. I'll check into it. What version do dedupe are you running?
0.6.1 at the moment. Thanks!
Hmm.. can't reproduce. Can you give us more details about your machine.
Can you also try setting num_cores = 1
On Fri, Aug 29, 2014 at 8:00 AM, deanthemachine notifications@github.com wrote:
0.6.1 at the moment. Thanks!
— Reply to this email directly or view it on GitHub https://github.com/datamade/dedupe-examples/issues/11#issuecomment-53871912 .
773.888.2718 2231 N. Monticello Ave Chicago, IL 60647
Thanks Forest - I'm going to do a fresh install of all this and just start from scratch.
Hi!
First of all great work, this stuff is amazing...
Second, I'm trying to work through the MySQL example and everytime I get to the "clustering" section it just hangs - no cpu, no ram usage, no errors, just stops.
As best I can tell it's stopping on the "clustered_dupes" section below. Anyone else have this issue or any suggestions?
c.execute("SELECT donor_id, city, name, " "zip, state, address, " "occupation, employer, person, block_id, smaller_ids " "FROM smaller_coverage " "INNER JOIN processed_donors " "USING (donor_id) " "ORDER BY (block_id)")
print 'clustering...' clustered_dupes = deduper.matchBlocks(candidates_gen(c), threshold=0.5)