Closed sorami closed 4 years ago
Apply the same fix as this PR https://github.com/WorksApplications/Sudachi/pull/118/ for Java implementation.
Related: #120
When there are more tokens than the original, due to the normalization, set the original to the first output token, not the last.
For example, currently,
$ echo … | sudachipy 補助記号,句点,*,*,*,* . 補助記号,句点,*,*,*,* . … 補助記号,句点,*,*,*,* . EOS
This will be fixed to
$ echo … | sudachipy … 補助記号,句点,*,*,*,* . 補助記号,句点,*,*,*,* . 補助記号,句点,*,*,*,* . EOS
Apply the same fix as this PR https://github.com/WorksApplications/Sudachi/pull/118/ for Java implementation.
Related: #120
When there are more tokens than the original, due to the normalization, set the original to the first output token, not the last.
For example, currently,
This will be fixed to