Open xfq opened 8 months ago
Should CJK Compatibility Ideographs be abandoned?
Should CJK Compatibility Ideographs be abandoned?
There seem to be some standard Chinese characters in CJK Compatibility Ideographs. @eisoch?
U+3007 (〇) IDEOGRAPHIC NUMBER ZERO in CJK Symbols and Punctuation (U+3000..U+303F) is also considered a hanzi by standards, dictionaries and UCS according to 「〇」算不算汉字? - 知乎 (Is “〇” a hanzi? - Zhihu).
Additionally, outside the list @xfq provided above, there are some other characters with script property “Han” in UCD, such as U+3005 (々) IDEOGRAPHIC ITERATION MARK and Suzhou numerals (U+3021..U+3029). Should they be listed?
We should probably list the various character sets defined by each region too, like https://www.w3.org/TR/hani-lreq/#h_script_overview
You probably need to revisit this list now that Unicode 16.0 has been released.
Also, there are other Unicode blocks that may need mentioning if mention is made of compatibility block (which i suggest should be mentioned separately, if at all), such as CJK radicals, CJK strokes, kanbun, etc. See a list at https://www.unicode.org/charts/ under East Asian Scripts.
It's probably best to clearly define what types of character should go in the list, and to do that to first be clear about why we're listing characters (ie. who will use the list, and for what).
It might be useful to list the Chinese characters in Unicode, like klreq and alreq: