OCR-D / zenhub

Repo for developing zenhub integration
Apache License 2.0
0 stars 0 forks source link

Decide on recommendation for edit distance algorithm #149

Open mweidling opened 1 year ago

mweidling commented 1 year ago

In the QA specs we claim that we provide a recommendation for the edit distance algorithm to be used.

For this we have to

bertsky commented 1 year ago

FYI I did some comprehensive testing of Python-available character alignment packages roughly 5 yrs ago, and the most important result was that you absolutely cannot trust but need to test them, thoroughly. Issues:

Tools besides rapidfuzz and jellyfish you might want to list:

mweidling commented 1 year ago

That's very valuable information, thank you! I will keep this in mind when we think about a recommendation (if we can break it down to one, actually).