buruzaemon / natto-py

natto-py combines the Python programming language with MeCab, the part-of-speech and morphological analyzer for the Japanese language.
BSD 2-Clause "Simplified" License
92 stars 13 forks source link

Erroneous handling of whitespace trimming #112

Closed buruzaemon closed 4 years ago

buruzaemon commented 4 years ago

When stripping whitespace from input, we need to be sure to limit trimming only to half-width whitespace shars. Full-width whitespace is recognized and handled by MeCab, and so should be left intact.

Please see https://github.com/buruzaemon/natto-py/pull/111

buruzaemon commented 4 years ago

Based on recent patch from @himkt, corrected behavior when stripping newline chars from MeCab output. Releasing fix in 0.9.1.