w3c / i18n-glossary

Definitions of terms used in W3C Internationalization documents.
https://w3c.github.io/i18n-glossary/
4 stars 4 forks source link

Legacy character encodings #60

Closed xfq closed 7 months ago

xfq commented 7 months ago

https://www.w3.org/TR/i18n-glossary/#dfn-legacy-character-encodings

Legacy character encodings. Character encoding forms that do not encode the full repertoire of characters in the Unicode character set.

It seems that UTF-32 and GB 18030 don't count as legacy character encodings according to this definition. Is it intentional?

aphillips commented 7 months ago

UTF-32 certainly is not a legacy character encoding, since it is, in fact, a Unicode encoding defined by Unicode.

GB18030 is an interesting case, since it does encode the full repertoire but is not a Unicode encoding defined by Unicode. I think I'm fine with the ambiguity in that case?

xfq commented 7 months ago

OK. Closing this. Thank you!