moj-analytical-services / splink_demos

Interactive notebooks containing demonstration code of the splink library
38 stars 27 forks source link

Invalid Input Error: Levenshtein Function: 2nd argument too short #55

Closed msiemionCalistapw closed 2 years ago

msiemionCalistapw commented 2 years ago

I am running the example model on my own data source and am getting this error "RuntimeError: Invalid Input Error: Levenshtein Function: 2nd argument too short"

my comparison settings are this: levenshtein_at_thresholds("Claim_description", 2),

looks like the codes is also failing on this line: linker.estimate_u_using_random_sampling(target_rows=1e6)

ThomasHepworth commented 2 years ago

Heya, could you post any questions or queries you have about splink here please.

If the question relates to these demos, please do feel free to post in here, however.

Thanks!

RobinL commented 2 years ago

See https://github.com/moj-analytical-services/splink/discussions/699#discussioncomment-3351927