hfhchan / irg

stuff for IRG
22 stars 2 forks source link

Misunification of A00851-003 (T6-2168) and A02977-001 (GKX-0249.04) (U+215D3) #180

Open hfhchan opened 4 years ago

hfhchan commented 4 years ago

T6-2168 is a mis-unified form of two characters. Based on current CNS11643, the pronunciations are given as tài (variant of 太) and lì (variant of 立)

Glyph in code charts: image

The source GKX-0249.04 is the variant of "立".

In the MOE dictionary, the form ⿱大一 is given as variant of 立 (A02977-001) while the form ⿵大一 is given as variant of 太 (A00851-003). A00851-003 is currently using U+215D3

Therefore, there are two suggested treatment methods:

  1. Keep the shape ⿵大一 at T6-2168, preserve the pronunciation of tài; Separately code ⿱大一 with pronunciation lì; Dis-unify T6-2168 from U+215D3, and re-encode as a new character or IVS to 太.

or

  1. Change the shape of T6-2168 to ⿱大一 and keep the pronunciation of lì only; Separately code ⿵大一 with pronunciation tài; Keep T6-2168 at U+215D3. Re-encode the new character or IVS to 太.

and modify the MOE dictionary to use U+215D3 in A02977-001, and PUA / new char for A00851-003.

hfhchan commented 4 years ago

The shape is ⿱大一 encoded at 12-223B in CNS11643 with pronunciation lì: image https://www.cns11643.gov.tw/wordView.jsp?ID=795195

So Option (1) would be preferred.