Anders429 / word_filter

A Word Filter for filtering text.
Apache License 2.0
1 stars 0 forks source link

Feature for canonical words #76

Open Anders429 opened 3 years ago

Anders429 commented 3 years ago

Consider adding a feature for canonical words, that allows storing the canonical word within Match states.

Currently, this is the default behavior. However, not all users will use the current find() method, and therefore they must use a suboptimal PDA. Making the canonical words be optional means that some builds could omit the word, and therefore they could benefit from merging of Match states and will have smaller binary sizes.

Additionally, consider renaming the find() method to find_canonical() to differentiate between find_raw().