dedupeio / dedupe-examples

:id: Examples for using the dedupe library
MIT License
406 stars 214 forks source link

MySQL Example Hangs #11

Closed deanthemachine closed 10 years ago

deanthemachine commented 10 years ago

Hi!

First of all great work, this stuff is amazing...

Second, I'm trying to work through the MySQL example and everytime I get to the "clustering" section it just hangs - no cpu, no ram usage, no errors, just stops.

As best I can tell it's stopping on the "clustered_dupes" section below. Anyone else have this issue or any suggestions?

c.execute("SELECT donor_id, city, name, " "zip, state, address, " "occupation, employer, person, block_id, smaller_ids " "FROM smaller_coverage " "INNER JOIN processed_donors " "USING (donor_id) " "ORDER BY (block_id)")

print 'clustering...' clustered_dupes = deduper.matchBlocks(candidates_gen(c), threshold=0.5)

fgregg commented 10 years ago

Hmm.. I'll check into it. What version do dedupe are you running?

deanthemachine commented 10 years ago

0.6.1 at the moment. Thanks!

fgregg commented 10 years ago

Hmm.. can't reproduce. Can you give us more details about your machine.

Can you also try setting num_cores = 1

On Fri, Aug 29, 2014 at 8:00 AM, deanthemachine notifications@github.com wrote:

0.6.1 at the moment. Thanks!

— Reply to this email directly or view it on GitHub https://github.com/datamade/dedupe-examples/issues/11#issuecomment-53871912 .

773.888.2718 2231 N. Monticello Ave Chicago, IL 60647

deanthemachine commented 10 years ago

Thanks Forest - I'm going to do a fresh install of all this and just start from scratch.