Unihandecode test case fails

miurahr / pykakasi

Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.

https://codeberg.org/miurahr/pykakasi

GNU General Public License v3.0

421 stars 54 forks source link

Closed miurahr closed 3 years ago

miurahr commented 3 years ago

Describe the bug Unihandecode uses pykakasi. When move to v2.0 API, it become fails the test.

To Reproduce Test case "バニーちゃんちのシャワーノズルの先端"

should separate to

"バニー" "ちゃんちの" "シャワーノズル" " の" "先端"

but it become

"バニーちゃんちのシャワーノズルの", "先端"

miurahr commented 3 years ago

It is because v2.0 API degraded not to detect character type split between hiragana and katakana.