google / mozc

Mozc - a Japanese Input Method Editor designed for multi-platform
Other
2.32k stars 329 forks source link

Typing issue: 𩸽 (ほっけ) #918

Closed tats-u closed 2 months ago

tats-u commented 2 months ago

Category of the typing issue

Choose one of them (delete rest of them)

  1. Out-of-vocabulary

Issues

Write issues to the following table. (It's in the markdown format)

input [e.g.ゆうひ] expected [e.g. 夕日] actual [e.g. ユウヒ]
ほっけ 𩸽 ほっけ,ホッケ,法華,法花,法身

Version or commit-id

[e.g. Mozc-2.28.4960.100+24.11.oss or d50a8b9ae28c4fba265f734b38bc5ae392fe4d25] You can get the version string by converting "Version" or "ばーじょん".

GoogleJapaneseInput-2.29.5374.100+24.11.9

Additional context

Add any other context about the problem here.

Commonly used as test cases for kanjis out of BMP:

𠮷野家で 𩸽頼んで 𠮟られる

rg "^ほっけ\t"
dictionary_oss\dictionary08.txt
1377:ほっけ     1847    1847    6174    ほっけ
69370:ほっけ    1919    1919    7208    法花
69371:ほっけ    1919    1919    7163    法身

dictionary_oss\dictionary07.txt
3164:ほっけ     1847    1847    5896    ホッケ
3165:ほっけ     1920    1920    7691    法花
3166:ほっけ     1847    1847    6479    法華
3167:ほっけ     1919    1919    5375    法華
3168:ほっけ     1920    1920    6581    法華
hiroyuki-komatsu commented 2 months ago

Thank you for your report. 3c10b49fcf43db32f578acb49a45cfb4fe5e1cff has added the word

tats-u commented 1 month ago

I downloaded an installer from https://github.com/google/mozc/actions/runs/9058758344 and installed it in Windows Sandbox, but 𩸽 didn't appear...

image image

Version: Mozc-2.30.5448.100+24.11.oss

hiroyuki-komatsu commented 1 month ago

Ah that's true. In short, the Windows version will support 𩸽 in the next full dictionary update.

𩸽 is added to aux_dictionary.txt that is used for Bazel builds, but not GYP builds. Since the Windows build uses GYP, 𩸽 is not added to the dictionary. The next full dictionary update will contain 𩸽.

We will migrate the Windows build from GYP to Bazel in future. Thanks,

tats-u commented 1 month ago

I see. I'll give up for that.