jedp / porter-stemmer

Martin Porter's stemmer for node.js
MIT License
100 stars 15 forks source link

Stemmer logic is not compliant with Porter Stemmer #5

Open josephcc opened 6 years ago

josephcc commented 6 years ago

"dry aged" should become "dri age" (see references), but this library produces "dry ag"

http://textanalysisonline.com/nltk-porter-stemmer http://text-processing.com/demo/stem/

only-cliches commented 6 years ago

Actually it is compliant, those websites are the ones that aren't compliant.

Check out the official example lists provided on this page.

The word "dry" is actually on the list, and it gets converted to "dry" according to the test list. The same is true for "age", it's a word provided on the list and the library conforms correctly, while the links you provided do not.

References: Input list, Output list