Closed oliviazhu closed 9 years ago
@dougfelt, @xiangyexiao : Is this still an issue? At least, punctuation marks are now centered.
/cc @xiangyexiao /cc @fontguy
It's corrected now.
Strangely, "Cantonese" is listed under Noto Sans CJK SC but not Noto Sans CJK TC. I suspect the majority of written Cantonese users would be people in Hong Kong who may prefer Noto Sans CJK TC. Second, it'll be more preferred if "Cantonese" were written as "Yue Chinese" to be more precise and consistent with the the other Chinese languages / dialects. Cantonese would translate to the language spoken in Guangzhou / Gaungdong, but Yue Chinese is surely not use in use in Guangzhou nor is it the sole majority language of Guangdong. The appropriation of supported languages between SC and TC seem arbritary too. Maybe it makes sense to list everything except SC and TC under both?
We're using the CLDR data, so in a fundamental sense these are CLDR issues.
CLDR thinks yue is written only in Hans. I think we can probably patch our copy of CLDR to say both Hans and Hant, with Hant being more likely.
CLDR defines the English name of yue as 'Cantonese'. I think, at least in English, 'Cantonese' is more commonly used than 'Yue Chinese' for yue, despite being technically a misnomer as you say. We identify languages by iso 639-2/3 language code, and there's no language code expressly for Cantonese. Finally, although we use yue internally as the code, I expect the font has been looked at primarily in terms of support for Cantonese in particular. I'm not sure whether Cantonese has character or glyph requirements different from Yue as a whole, but if it does, I expect Cantonese has been the driver of these requirements that we managed to support (and I expect we don't support them all). So all in all I think 'Cantonese' is probably still a better term for us to use.
The main dialect of Yue Chinese is indeed usually referred as Cantonese, so it's not too a big deal. The characters required by written Cantonese sans written Standard Chinese are predominantly encoded in HKSCS - of which is the covering target of the TC version.
Patching to say both hant and hans seems a good solution. Apart from their howntown being in China, there's also a hefty hakka population in Hong Kong and a min population in Taiwan, both areas using Traditional Chinese. These other two languages would more accurately be listed under both. Lastly it takes to note that Wikipedia uses simplified Chinese for all Chinese languages except Yue and Literary Chinese by default - so that may suggest default preferences.
I'm closing this since the sample image problem has been fixed. I opened a separate bug for the language-script pairing issue.
What steps will reproduce the problem?
What is the expected output? What do you see instead? They look like SC variant, not TC variant.
I can confirm that the TC fonts were not used to create these specimens. The punctuation, which is not centered, is another clue.
Moved from googlei18n/noto-fonts#157