RobinL / fuzzymatcher

Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4
MIT License
281 stars 60 forks source link

Link table fails on NaN #27

Closed HaydenSansum closed 6 years ago

HaydenSansum commented 6 years ago

The link table function will fail and throw a Type error if there are any NaNs in the matching columns.

TypeError: ('sequence item 0: expected str instance, float found', 'occurred at index 10')

RobinL commented 6 years ago

code to fix is here https://github.com/RobinL/fuzzymatcher/blob/800d0de35188410d81e537a281b2b89a050b397e/fuzzymatcher/data_preprocessor_default.py#L53 best option probably to fillna on the columns before that line