nickgrasley / ml-record-linking

Code for linking census records
0 stars 0 forks source link

Test `blocker.py` #23

Closed benbusath closed 4 years ago

benbusath commented 4 years ago

BlockDB fails with really large queries (several blocks, large tables etc.) without using merge keys, and throws SQL primary key error when ran with primary keys.

benbusath commented 4 years ago

Figured out issue with primary merge keys: you have to make sure to delete the blocking destinatino table before retrying blocking. Large queries can be solved with lowering the chunksize.