notofonts / noto-cjk

Noto CJK fonts
http://www.google.com/get/noto/help/cjk
2.94k stars 215 forks source link

Orthographic errors in Noto Sans CJK Simplified Chinese #273

Open mrk-z opened 3 months ago

mrk-z commented 3 months ago

We noticed some orthographic errors in the Simplified Chinese (SC) version of Noto Sans CJK.

Overall it seems that some Traditional Chinese (TC) or Japanese glyphs are included in the SC subset.

艹: In certain glyphs, the horizontal stroke separates into two. In SC, the horizontal stroke is usually a continuous stroke. The two horizontal strokes is usually happens in TC. Examples: 茠荁莁菗

Capture d’écran 2024-05-17 à 15 10 13

山: In certain glyphs, the bottom of left vertical stroke extends below the horizontal stroke. In SC, the bottom left corner is usually aligned. The extended left vertical stroke is usually seen in Japanese typefaces. Examples: 圸杣乢

Capture d’écran 2024-05-17 à 15 10 57

丰: In certain glyphs, the first horizontal stroke curves up. In SC, the first horizontal stroke is usually completely horizontal. The curved stroke is usually seen in TC. Examples: 玤垹

Capture d’écran 2024-05-17 à 15 06 29

女: In certain glyphs, the end of the first diagonal stroke extends over the beginning of the 丶 stroke In SC, the two strokes of 𡿨 is usually aligned. The extended first stroke is usually seen in Japanese. Examples: 嬶娯婲娿

Capture d’écran 2024-05-17 à 15 37 58

月: In certain glyphs, the middle strokes are diagonal. In SC, the middle strokes are usually horizontal. The diagonal strokes are usually seen in TC or Japanese Examples: 脪臐胣腖

Capture d’écran 2024-05-17 à 15 51 04

(for 胣, the 也 part also has a form usually found in TC.)

NightFurySL2001 commented 3 months ago

Note that the project only has a limited coverage outside of GB2312, GB12345 and 通用规范汉字表 for Simplified Chinese. Quote:

Since GB 18030 isn't a glyph standard you can't really say anything is not "GB compliant". I'm not opposed to having a note somewhere, but I wouldn't call it a known issue or an issue at all. It's just a statement of clarification that not all glyphs follow PRC representative glyph conventions.

Originally posted by @punchcutter in https://github.com/adobe-fonts/source-han-sans/issues/412#issuecomment-1728672647

Noted 3 characters are in 通用规范汉字表, which had been reported in https://github.com/adobe-fonts/source-han-sans/issues/313, https://github.com/adobe-fonts/source-han-sans/issues/324 and https://github.com/adobe-fonts/source-han-sans/issues/206#issuecomment-560241926

A few others are mentioned here:

These glyphs only have a TW form right now and no current plan to add a CN glyph without some compelling argument.

Originally posted by @punchcutter in https://github.com/adobe-fonts/source-han-serif/issues/200#issuecomment-1711146860

You can see the glyph sharing status with my tool: https://nightfurysl2001.github.io/shs-cid/ nightfurysl2001 github io_shs-cid_