google / mozc

Mozc - a Japanese Input Method Editor designed for multi-platform
Other
2.32k stars 329 forks source link

Typing issue: 葛󠄀城市 (containing IVS; Nara Prefecture) #919

Open tats-u opened 2 months ago

tats-u commented 2 months ago

Category of the typing issue

Choose one of them (delete rest of them)

  1. Out-of-vocabulary

Issues

Write issues to the following table. (It's in the markdown format)

input [e.g.ゆうひ] expected [e.g. 夕日] actual [e.g. ユウヒ]
かつらぎし 葛󠄀城市,葛城市 葛城市
かつらぎ 葛󠄀城,葛城 葛城

Version or commit-id

[e.g. Mozc-2.28.4960.100+24.11.oss or d50a8b9ae28c4fba265f734b38bc5ae392fe4d25] You can get the version string by converting "Version" or "ばーじょん".

GoogleJapaneseInput-2.29.5374.100+24.11.9

Additional context

Add any other context about the problem here.

"葛󠄀城" is the intended form. The IVS U+E0100 is required when the current font conforms to JIS2004 and supports Adobe-Japan1.

https://www.city.katsuragi.nara.jp/shisei/shinoshokai/3700.html

葛城市 → 葛 城 市 葛󠄀城市 → 葛 U+E0100 城 市

rg "^かつらぎし?\t.*城市?$"
dictionary_oss\dictionary08.txt
20590:かつらぎ  1919    1919    6753    桂城

test\dictionary\dictionary.txt
25084:かつらぎ  2010    2010    4447    葛城
53647:かつらぎ  2010    2010    4447    葛城

dictionary_oss\dictionary02.txt
94083:かつらぎし        1920    2014    4870    葛城市

dictionary_oss\dictionary05.txt
63915:かつらぎ  1919    1919    4742    葛城
63916:かつらぎ  1920    1920    4695    葛城

This can be used for test cases for IVS.


This is an issue only for the proprietary Google IME issue, but the name of Nezuko Kamado, the sister of the main chracter of Demon Slayer: Kimetsu no Yaiba, has a similar problem.