LuteOrg / lute-v3

LUTE = Learning Using Texts: learn languages through reading.
https://luteorg.github.io/lute-manual/
MIT License
491 stars 46 forks source link

Japanese not handling repeat char correctly #488

Closed jzohrab closed 1 month ago

jzohrab commented 1 month ago

Sample current result:

image

For text:

聞こえる行く先々

新品 あたいのBody
Singing mi nah tell nuh lie
金銀 あたいの価値

Also spaces being removed for some reason.

jzohrab commented 1 month ago

Fixed in develop for the "repeat" character.

Mecab is stripping space characters ... I didn't want to fix this one just yet as I think it's rare, and have enough to do.

Singing mi nah tell nuh lie parsed yields

Singing 5   45
mi  5   38
nah 5   38
tell    5   38
nuh 5   38
lie 5   38
    0   4
EOP 3   7
jzohrab commented 1 month ago

In release 3.5.5.