morfologik / morfologik-stemming

Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.
BSD 3-Clause "New" or "Revised" License
187 stars 44 forks source link

Runon candidates #103

Closed danielnaber closed 4 years ago

danielnaber commented 4 years ago

This introduces replaceRunOnWordCandidates, similar to findReplacementCandidates (i.e. the result being data with distances, not just strings). It also modifies findReplacementCandidates so that at the end the distance is kept and not increased (CandidateData constructor uses frequency data, i.e. frequency data was used twice).