dedupeio / dedupe-examples

:id: Examples for using the dedupe library
MIT License
406 stars 214 forks source link

Long Run Time in csv example #17

Closed xykev closed 9 years ago

xykev commented 9 years ago

I tried running the csv example and it seems to have a very long run time. I am able to enter my responses to the training questions with no problem but after I finish, the program appears to freeze.

With some testing it appears that entering 'f' a second time results in the following error message:

RuntimeError: Attempt to start a new process before the current process has finished its bootstrapping phase.

        This probably means that you are on Windows and you have
        forgotten to use the proper idiom in the main module:

            if __name__ == '__main__':
                freeze_support()
                ...

        The "freeze_support()" line can be omitted if the program
        is not going to be frozen to produce a Windows executable.

Any help would be appreciated!

EDIT: disabling multiprocessing by changing the line

dedupe.Dedupe(fields, num_cores=4)

seems to have fixed the problem as with my other issue.