w3c / i18n-actions

Action item tracker for the i18n WG.
1 stars 0 forks source link

make a list of invisible characters to support html 5121 discussion #73

Closed ghurlbot closed 4 months ago

ghurlbot commented 4 months ago

Opened by @aphillips via IRC channel #i18n on irc.w3.org

Due: 2024-02-22 (Thursday 22 February)

r12a commented 4 months ago

Priority items

For me, of those that are missing, these are the highest priority. I suggest possible named entities, derived from the standard Unicode abbreviations.

I'd also like to have &zwsp; in addition to ​for U+200B

Full list

It took a while to figure out how to come up with a reasonable list. The following is from From https://util.unicode.org/UnicodeJsps/list-unicodeset.jsp?a=%5B%3Adi%3A%5D%5B%3Awhite_space%3A%5D-%5B%3ACn%3A%5D&g=&i= but with some items manually excised because i felt they were not necessary. I marked in bold the ones for which we already have named entities. Ones i'm not sure about are in italics.

Latin 1 Supplement — Latin-1 punctuation and symbols

Combining Diacritical Marks — Grapheme joiner

Arabic — Format character

Hangul Jamo — Old initial consonants

Hangul Jamo — Medial vowels

Ogham — Space

Mongolian — Format controls

General Punctuation — Spaces

General Punctuation — Format character

General Punctuation — Separators

General Punctuation — Space

General Punctuation — Invisible operators

CJK Symbols And Punctuation — CJK symbols and punctuation

Hangul Compatibility Jamo — Special character

Halfwidth And Fullwidth Forms — Halfwidth Hangul variants

Shorthand Format Controls — Shorthand format controls

Musical Symbols — Beams and slurs

Emoji Variation Selectors - turns on and off colour

ghurlbot commented 4 months ago

Closed by @aphillips via IRC channel #i18n on irc.w3.org