Open eisoch opened 4 years ago
use for U+45A2 䖢
https://www.unicode.org/L2/L2016/16385r-uax45-add-ideo.pdf
use for U+2028E 𠊎
https://m.sohu.com/a/151136591_164554 https://www.sohu.com/a/122576553_505705 https://you.autohome.com.cn/details/111119
use for U+2CCB1 𬲱
http://www.aminoacid-jirong.com/news/9z2996qq9j3j3f9.html
use for U+2CCC7 𬳇
http://baoji.mofcom.gov.cn/article/dxsw/201410/20141000762978.shtml
use for U+2EA3B 𮨻
https://www.sohu.com/a/205854785_531962
report for U+310F1 𱃪
http://news.sina.com.cn/s/2005-03-22/03115425279s.shtml
use for U+310F1 𱃱
Some non-G0 or non-G1 common Chinese dialectal ideographs have been included in IICore or UnihanCore.
There are two rules in GB/T 22484-2016 for naming the metro and bus stations related to this list, A.3.2 and A.3.9.
UCS | Char. | G Ref. | IICore | UHCore | TGH | Dialect | T/S | Note |
---|---|---|---|---|---|---|---|---|
U+3635 | 㘵 | G3-3370 | H | yue-CN-GD-CAN | TS | 清㘵: metro station; also used for place name in Fujian and Guangdong | ||
U+3E06 | 㸆 | GS-215F | GH | 6411 | cmn-CN-SD-ZBO | TS | 干㸆鸡块, 煎㸆大虾: food; also used for DB37/T 1976-2011 | |
U+4C7D | 䱽 | GHZ‑74707.08 | CH | H | yue-CN-HK | T | pomfret | |
U+5526 | 唦 | G3-3643 | HMT | cmn-CN-HB-WUH | TS | 板唦: attitude | ||
U+5710 | 圐 | G3-3759 | G | 7369 | cjy-CN-NM-ODS-JUN | TS | 糕圐圙: food; also used for DB15/T 815-2014 | |
U+5719 | 圙 | G3-375C | G | 7677 | cjy-CN-NM-ODS-JUN | TS | 糕圐圙: food; also used for DB15/T 815-2014 | |
U+5871 | 塱 | G5-365B | BTH | GHMT | 7601 | yue-CN-GD-CAN | TS | 西塱: metro station |
U+5C7B | 屻 | G5-3A5C | HMT | hak-CN-HK | TS | legacy use for 打铁岃 as the bus station | ||
U+5C83 | 岃 | GH-1105 | H | hak-CN-GD-MXZhak-CN-HK | TS | 赤岃岗: bus station打铁岃: bus station | ||
U+6725 | 朥 | GE-3C3E | H | nan-CN-GD-SWA | T | 朥餅: food; also used for T/CZSPTXH 075-2018 | ||
U+6818 | 栘 | G3-413C | AT | HMT | cmn-CN-YN-KMG | TS | 酸栘𣐿: tree | |
U+681F | 栟 | G3-4140 | GHMT | 4461 | wuu-CN-JS-RDG | TS | 栟茶: train station | |
U+6C39 | 氹 | G5-5460 | BHM | HM | yue-CN-MO | TS | 氹仔: LTR line | |
U+6D50 | 浐 | G8-7C78 | G | 6895 | cmn-CN-SN-SIA | S | 浐河: metro station | |
U+6E74 | 湴 | G3-546F | GHMT | 7239 | yue-CN-GD-CAN | TS | 长湴: metro station | |
U+6E81 | 溁 | G8-2E5D | G | 7427 | hsn-CN-HN-CSX | S | 溁湾镇: metro station | |
U+6ED8 | 滘 | G3-5563 | BGH | GHMT | 7596 | yue-CN-GD-LWQyue-CN-GD-SUDyue-CN-GD-DGG | TS | 滘口: metro station北滘: train station道滘: train station |
U+6EFB | 滻 | G3-542A | HMT | cmn-CN-SN-SIA | T | 滻河: metro station | ||
U+6FDA | 濚 | G5-5628 | AKP | HKP | hsn-CN-HN-CSX | T | 濚灣鎮: metro station | |
U+713F | 焿 | GE-4079 | CT | T | nan-CN-TW-KEE | TS | 红烧鳗焿 & 豆签焿: food | |
U+7201 | 爁 | G3-503D | HMT | cmn-CN-SN-SIA | T | 爁臊子: food | ||
U+784B | 硋 | G5-582D | H | cdo-CN-FJ-NDS | TS | ceramic | ||
U+7881 | 碁 | GE-443D | ATJKP | HJKP | yue-CN-HK | V | 石碁: metro station | |
U+78B6 | 碶 | G3-583D | GH | 7659 | wuu-CN-ZJ-NGB | TS | 大碶: metro station | |
U+7910 | 礐 | G3-5767 | HMT | nan-CN-GD-SWA | T | 礐石風景區: bus station | ||
U+7C84 | 粄 | G5-6658 | CT | HMT | hak-CN-GD-MXZ | TS | food; also used for T/GDMZCX 031-2019, 2.22 in T/GDMZCX 001-2019 | |
U+7CBF | 粿 | G5-666C | CT | GHMT | 5792 | nan-CN-GD-SWA | TS | food; also used for T/CZSPTXH 001-2018.1, T/CZSPTXH 004-2018, T/CZSPTXH 070-2018, T/CZSPTXH 074-2018, T/CZSPTXH 077-2018, T/CZSPTXH 079-2019, T/CZSPTXH 085-2019, T/CZSPTXH 98.1-2019, T/CZSPTXH 101.1-2019, T/CZSPTXH 104.1-2019, T/YWYCY 3-2019 |
U+7F49 | 罉 | GH-1278 | H | yue-CN-HK | TS | 甜薄罉: desert, how to do | ||
U+8137 | 脷 | GH-1281 | BHM | HM | yue-CN-HK | TS | 龙脷鱼: sole fish | |
U+81A5 | 膥 | GH-1110 | CH | H | yue-CN-HK | TS | 多膥鱼: capelin | |
U+83C9 | 菉 | GE-495D | AGTKP | GHKMPT | 7133 | yue-CN-GD-ZHA; wuu-CN-JS-KUS | TS | 菉塘: bus station, 菉溪路联谊路: bus station |
U+8855 | 衕 | GE-4B77 | HMT | cmn-CN-BJ | V | 衚衕: alley in the common translation, derived from ᠬᠤᠳᠳᠤᠭ or ᠬᠣᠲᠠ possibly | ||
U+885A | 衚 | GE-4B7A | HMT | cmn-CN-BJ | V | 衚衕: alley in the common translation, derived from ᠬᠤᠳᠳᠤᠭ or ᠬᠣᠲᠠ possibly | ||
U+8FCC | 迌 | GE-4F2F | H | nan-CN-TW-TPE | TS | 《𨑨迌》 | ||
U+9120 | 鄠 | G3-7137 | GHMT | 7510 | cmn-CN-SN-HUX | TS | 鄠邑: train station | |
U+98E5 | 飥 | G3-7A4A | HMT | cmn-CN-SN-YNA | T | 䬣飥: food | ||
U+990E | 餎 | G3-7A5E | H | cmn-CN-HA-JXY; cmn-CN-HE-NJN | T | 餄餎麺: food | ||
U+991C | 餜 | G3-7A6E | H | cmn-CN-TJ | T | 煎餅餜子: food | ||
U+992C | 餬 | GE-545C | BTH | HJMT | cmn-CN-SN-WNA | T | 餬餑: food | |
U+993D | 餽 | GE-5462 | ATH | HJMT | cmn-CN-HE-ZJK; cmn-CN-BJ-YQX | T | 炒餽⿰飠畾 | |
U+9978 | 饸 | G8-2D45 | CG | G | 4300 | cmn-CN-HA-JXY; cmn-CN-HE-NJN | S | 饸饹面: food; also used for DB15/T 794-2014, DB41/T 1620-2018, T/JCCZ 001-2019, T/WRSPCYHY 002-2020, T/JCCZ 103-2020 |
U+9979 | 饹 | G8-2D46 | CG | G | 4301 | cmn-CN-HA-JXY; cmn-CN-HE-NJN | S | 饸饹面: food; also used for DB15/T 794-2014, DB41/T 1620-2018, T/JCCZ 001-2019, T/WRSPCYHY 002-2020, T/JCCZ 103-2020 |
U+9983 | 馃 | G8-2D4A | CG | G | 4932 | cmn-CN-TJ | S | 煎饼馃子: food; also used for T/TJCY 002-2018 |
U+9B93 | 鮓 | G3-775C | CJ | HJMT | cmn-CN-GZ-KWE; cmn-CN-GZ-PXN; cmn-CN-CQ | T | 小米鮓: a kind of food for Guiyang; 鮓面粑: a kind of food for Panzhou; 鮓鴨肉: a kind of food for Chongqing | |
U+9BB0 | 鮰 | G3-7773 | H | wuu-CN-JS-NTG | T | 鮰魚: fish; also used for GB/T 15805.4-2018, T/NTCYHY 002.4-2018 | ||
U+9C02 | 鰂 | G3-7774 | CH | H | yue-CN-HK | T | 鰂魚涌/Quarry Bay: metro station, 鰂魚: crucian | |
U+9C72 | 鱲 | G3-7879 | BTH | H | yue-CN-HK | T | 赤鱲角消防局: bus station | |
U+9C8A | 鲊 | G8-2D31 | G | 7558 | cmn-CN-GZ-KWE; cmn-CN-GZ-PXN; cmn-CN-CQ; cmn-CN-SC-RHQ | S | 小米鲊: a kind of food for Guiyang; 鲊面粑: a kind of food for Panzhou; 鲊鸭肉: a kind of food for Chongqing; also used for DB50/T 792-2017, the standard text is here; also used for T/GZSX 006-2016 and T/GZSX 024-2017; 拉鲊: train station | |
U+9C97 | 鲗 | G8-2D34 | G | 7703 | yue-CN-HK | S | 鲗鱼涌/Quarry Bay: metro station, 鲗鱼: crucian | |
U+9C98 | 鲘 | G8-2D35 | G | 7704 | nan-CN-GD-HIF | S | 鲘门: train station | |
U+20547 | 𠕇 | H | nan-CN-GD-SWA | TS | 𠕇鱼: fish | |||
U+20C96 | 𠲖 | GHZ‑10622.01 | CH | H | yue-CN-GD-ZHA | TS | 𠲖记鸡饭店: bus station | |
U+2343F | 𣐿 | GKX-0526.14 | H | cmn-CN-YN-KMG | TS | 酸栘𣐿: tree | ||
U+241B5 | 𤆵 | GHZ‑32194.05 | H | cmn-CN-YN-WSY; cmn-CN-CQ-SZY | TS | 𤆵肉饵丝: food, 桃园𤆵牛肉: food (The people lived in Shizhu County sometimes write the wrong character as "桃园耙牛肉"); also used for T/YCX 0106-2018 | ||
U+266E8 | 𦛨 | H | nan-CN-GD-SWA | S | 𦛨饼: food | |||
U+28468 | 𨑨 | GKX-1253.39 | H | nan-CN-TW-TPE | TS | 《𨑨迌》 | ||
U+2B6AD | 𫚭 | GCH-4027.31 | G | 8100 | yue-CN-HK | S | 赤𫚭角消防局: bus station | |
U+2C488 | 𬒈 | GCH-1010.22 | G | 7070 | nan-CN-GD-SWA | S | 𬒈石风景区: bus station | |
U+2C494 | 𬒔 | GGFZ‑011300 | G | 7506 | yue-CN-GD-FOS | TS | 石𬒔总站: bus station |
Ongoing encoding characters are shown as below.
WSSN | Pre-UCS | IDS | Ref. | Dialect | Popularity | T/S | Wr. | Note |
---|---|---|---|---|---|---|---|---|
WS2017-02313 | ⿰火足 | GZA-204390 | nan-CN-GD-SWA | local | TS | 焯 | ~肉: food | |
WS2017-03377 | ⿰米某 | UTC‑02979 | nan-CN-GD-SWE | local | TS | porridge | ||
WS2017-03891 | ⿰虫崩 | UTC-02975 | yue-CN-GD-SUD | Pearl River Delta and Hong Kong, Macao | TS | 蹦 | ~䖢: food | |
WS2017-04751 | ⿰饣召 | UK-10440 | cmn-CN-CQ | local | S | 哨 | ~子铺盖面: food | |
UNC2020-1 | U+9FFD | ⿰口窄 | MC-00134 | yue-CN-HK; yue-CN-MO | local | TS | 渣~/喳~: desert |
The following characters are needed to encode in future.
IDS | Dialect | Popularity | T/S | Wr. | Note | Evidence |
---|---|---|---|---|---|---|
⿰火掌 | yue-CN-GD-ZHA | local | TS | 瓦~焗鸡: baked chicken by earthen pot | absent | |
⿰米困 | hak-CN-GD-FES | local | TS | ~粄: food | absent | |
⿺辶喜 | yue-CN-HK; yue-CN-MO | HK and Macao | TS | attitude | 《俗》 | |
⿰飠荅 | cmn-CN-SD-LCH | local | T | 𩛃~: food | absent | |
⿰飠畾 | cmn-CN-HE-ZJK; cmn-CN-BJ-YQX | local | T | 儡 | 餽~: food | absent |
⿰飠旋 | cmn-CN-SD-TNA | local | T | 旋 | 油~兒: food | absent |
⿰饣夬 | cmn-CN-YN-KMG | local | S | 块 | 饵~: food; DB5223/T 6-2019 uses the wrong character | 《烧饵~》 |
⿰饣瓜 | cmn-CN-SD-LCH | local | S | ~⿰饣荅 or ~⿰饣答: food | absent | |
⿰饣荅 | cmn-CN-SD-LCH | local | S | ⿰饣瓜~: food | absent | |
⿰饣鬼 | cmn-CN-HE-ZJK; cmn-CN-BJ-YQX | local | S | 傀 | ~⿰饣畾: food | absent |
⿰饣答 | cmn-CN-SD-LCH | local | S | ⿰饣瓜~: food | absent | |
⿰饣畾 | cmn-CN-HE-ZJK; cmn-CN-BJ-YQX | local | S | 儡 | ⿰饣鬼~: food | absent |
I provide the instable codes for sub-languages or sub-dialects of the tag cmn
and some newfound dialects in Guangdong and Hunan as below. It is not suitable to submit them to ISO 639-3/RA currently.
Code | Definition | Macrolanguage | Deprecated |
---|---|---|---|
dsm |
东北官话 | zho : cmn |
|
cbm |
北京官话 | zho : cmn |
Yepocapa Southwestern CakchiquelMerged with Kaqchikel |
hsm |
冀鲁官话 | zho : cmn |
|
jlm |
胶辽官话 | zho : cmn |
|
cpm |
中原官话 | zho : cmn |
|
lym |
兰银官话 | zho : cmn |
|
jhm |
江淮官话 | zho : cmn |
|
cxm |
西南官话 | zho : cmn |
|
cxn |
湘南土话 | zho |
|
szt |
韶州土话 | zho |
|
cxx |
湘西乡话 | zho |
This list is prepared to include the encoded ideographs which could be used to record the common words in different Chinese dialects, and the characters should be common and familiar for the local people at least. The characters in this list are out of IICore and UnihanCore, so we can treat it as a supplement of IICore and UnihanCore. For example, “唦” is not a G0 character, and it's always used in a Wuhan dialect word “板唦” by Dada Band (达达乐队), but it has been included in UnihanCore which is marked as HMT, so there is no need to include it here. Please click here to see the explanation by Dada Band.
Some people will use a wrong character to replace the right one which is hard to input although they know how to write the right one. Some wrong characters have become more and more popular.
The dialect column is based on ISO 639-3:2007, GB/T 2260-2007 and ISO 3166-1:2020, ISO 3166-2:CN, ISO 3166-2:TW, ISO 3166-2:HK, ISO 3166-2:MO, ISO 3166-2:JP, ISO 3166-2:KR, ISO 3166-2:KP, ISO 3166-2:VN, ISO 3166-2:SG, ISO 3166-2:MY, and we can add more than one value if possible.