latex3 / unicode-math

XeLaTeX/LuaLaTeX package for using unicode/OpenType maths fonts
http://ctan.org/pkg/unicode-math
LaTeX Project Public License v1.3c
239 stars 28 forks source link

Comparison with Unicode TR25 data #619

Open davidcarlisle opened 10 months ago

davidcarlisle commented 10 months ago

Description

unicode.xml at https://github.com/w3c/xml-entities/ has data for unicode-math and MathClass-15 from TR25

I have updated it to track updates here and also added a comparison of the math classes assigned, in the stylesheet comp-um.xsl in that repository.

The resulting log is included here. Some of the differences are intentional, but some are probably errors in MathClass-15 or unicode-math-table.

In the list below

*** denotes a character listed in MathClass-15 with no entry in unicode-math-table.

??? denotes a csname unicode.xml associates with unicode-math that is not in unicode-math-table (all now fixed)

! denotes an entry in unicode-math that is not in MathClass-15



#   N - Normal - includes all digits and symbols requiring only one form
#   A - Alphabetic
#   B - Binary
#   C - Closing - usually paired with opening delimiter
#   D - Diacritic
#   F - Fence - unpaired delimiter (often used as opening or closing)
#   G - Glyph_Part - piece of large operator
#   L - Large - n-ary or large operator, often takes limits
#   O - Opening - usually paired with closing delimiter
#   P - Punctuation
#   R - Relation - includes arrows
#   S - Space
#   U - Unary - operators that are only unary
#   V - Vary - operators that can be unary or binary depending on context
#   X - Special - characters not covered by other classes

------
               not(
              (@mathclass='R' and $um='\mathrel')
           or (@mathclass='B' and $um='\mathbin')
           or (@mathclass='V' and $um='\mathbin')
           or (@mathclass='A' and $um='\mathalpha')
           or (@mathclass='N' and $um='\mathord')
           or (@mathclass='G' and $um='\mathord')
           or (@mathclass='U' and $um='\mathord')
           or (@mathclass='O' and $um='\mathopen')
           or (@mathclass='C' and $um='\mathclose')
           or (@mathclass='P' and $um='\mathpunct')
           or (@mathclass='F' and $um='\mathfence')
           or (@mathclass='L' and $um='\mathop')
           or (@mathclass='D' and $um='\mathaccent')
           )

------

U00021 | \mathexclam                | N | \mathclose         | EXCLAMATION MARK
U0002E | \mathperiod                | P | \mathord           | FULL STOP
U0002F | \mathslash                 | B | \mathord           | SOLIDUS
U0003F | \mathquestion              | P | \mathord           | QUESTION MARK
U0005C | \backslash                 | B | \mathord           | REVERSE SOLIDUS
U000A0 | ***                        | S | ***                | NO-BREAK SPACE
U000A1 | ***                        | P | ***                | INVERTED EXCLAMATION MARK
U000A2 | ***                        | N | ***                | CENT SIGN
U000A4 | ***                        | N | ***                | CURRENCY SIGN
U000A6 | ***                        | N | ***                | BROKEN BAR
U000A7 | ***                        | N | ***                | SECTION SIGN
U000A8 | ***                        | D | ***                | DIAERESIS
U000AF | ***                        | D | ***                | MACRON
U000B0 | ***                        | N | ***                | DEGREE SIGN
U000B2 | ***                        | N | ***                | SUPERSCRIPT TWO
U000B3 | ***                        | N | ***                | SUPERSCRIPT THREE
U000B4 | ***                        | N | ***                | ACUTE ACCENT
U000B5 | ***                        | N | ***                | MICRO SIGN
U000B6 | ***                        | N | ***                | PILCROW SIGN
U000B9 | ***                        | N | ***                | SUPERSCRIPT ONE
U000BC | ***                        | N | ***                | VULGAR FRACTION ONE QUARTER
U000BD | ***                        | N | ***                | VULGAR FRACTION ONE HALF
U000BE | ***                        | N | ***                | VULGAR FRACTION THREE QUARTERS
U000BF | ***                        | P | ***                | INVERTED QUESTION MARK
U000F0 | \matheth                   | ! | \mathalpha         | LATIN SMALL LETTER ETH
U00131 | ***                        | A | ***                | LATIN SMALL LETTER DOTLESS I
U001B5 | \Zbar                      | ! | \mathord           | LATIN CAPITAL LETTER Z WITH STROKE
U00237 | ***                        | A | ***                | LATIN SMALL LETTER DOTLESS J
U002C6 | ***                        | D | ***                | MODIFIER LETTER CIRCUMFLEX ACCENT
U002C7 | ***                        | D | ***                | CARON
U002D8 | ***                        | D | ***                | BREVE
U002D9 | ***                        | D | ***                | DOT ABOVE
U002DA | ***                        | D | ***                | RING ABOVE
U002DC | ***                        | D | ***                | SMALL TILDE
U00309 | \ovhook                    | ! | \mathaccent        | COMBINING HOOK ABOVE
U00310 | \candra                    | ! | \mathaccent        | COMBINING CANDRABINDU
U00311 | ***                        | D | ***                | COMBINING INVERTED BREVE
U00312 | \oturnedcomma              | ! | \mathaccent        | COMBINING TURNED COMMA ABOVE
U00315 | \ocommatopright            | ! | \mathaccent        | COMBINING COMMA ABOVE RIGHT
U0031A | \droang                    | ! | \mathaccent        | COMBINING LEFT ANGLE ABOVE
U00323 | ***                        | D | ***                | COMBINING DOT BELOW
U0032C | ***                        | D | ***                | COMBINING CARON BELOW
U0032D | ***                        | D | ***                | COMBINING CIRCUMFLEX ACCENT BELOW
U0032E | ***                        | D | ***                | COMBINING BREVE BELOW
U0032F | ***                        | D | ***                | COMBINING INVERTED BREVE BELOW
U00330 | \wideutilde                | D | \mathbotaccentwide | COMBINING TILDE BELOW
U00331 | ***                        | D | ***                | COMBINING MACRON BELOW
U00333 | ***                        | D | ***                | COMBINING DOUBLE LOW LINE
U00338 | \notaccent                 | D | \mathaccentoverlay | COMBINING LONG SOLIDUS OVERLAY
U0033A | ***                        | D | ***                | COMBINING INVERTED BRIDGE BELOW
U0033F | ***                        | D | ***                | COMBINING DOUBLE OVERLINE
U00346 | ***                        | D | ***                | COMBINING BRIDGE ABOVE
U003A5 | \m?upUpsilon               | ! | \mathalpha         | GREEK CAPITAL LETTER UPSILON
U003C2 | \m?upvarsigma              | ! | \mathalpha         | GREEK SMALL LETTER FINAL SIGMA
U003D2 | ***                        | A | ***                | GREEK UPSILON WITH HOOK SYMBOL
U00428 | ***                        | A | ***                | CYRILLIC CAPITAL LETTER SHA
U00606 | ***                        | L | ***                | ARABIC-INDIC CUBE ROOT
U00607 | ***                        | L | ***                | ARABIC-INDIC FOURTH ROOT
U00608 | ***                        | A | ***                | ARABIC RAY
U00627 | ***                        | A | ***                | ARABIC LETTER ALEF
U00628 | ***                        | A | ***                | ARABIC LETTER BEH
U00629 | ***                        | A | ***                | ARABIC LETTER TEH MARBUTA
U0062A | ***                        | A | ***                | ARABIC LETTER TEH
U0062B | ***                        | A | ***                | ARABIC LETTER THEH
U0062C | ***                        | A | ***                | ARABIC LETTER JEEM
U0062D | ***                        | A | ***                | ARABIC LETTER HAH
U0062E | ***                        | A | ***                | ARABIC LETTER KHAH
U0062F | ***                        | A | ***                | ARABIC LETTER DAL
U00630 | ***                        | A | ***                | ARABIC LETTER THAL
U00631 | ***                        | A | ***                | ARABIC LETTER REH
U00632 | ***                        | A | ***                | ARABIC LETTER ZAIN
U00633 | ***                        | A | ***                | ARABIC LETTER SEEN
U00634 | ***                        | A | ***                | ARABIC LETTER SHEEN
U00635 | ***                        | A | ***                | ARABIC LETTER SAD
U00636 | ***                        | A | ***                | ARABIC LETTER DAD
U00637 | ***                        | A | ***                | ARABIC LETTER TAH
U00638 | ***                        | A | ***                | ARABIC LETTER ZAH
U00639 | ***                        | A | ***                | ARABIC LETTER AIN
U0063A | ***                        | A | ***                | ARABIC LETTER GHAIN
U02000 | ***                        | S | ***                | EN QUAD
U02001 | ***                        | S | ***                | EM QUAD
U02002 | ***                        | S | ***                | EN SPACE
U02003 | ***                        | S | ***                | EM SPACE
U02004 | ***                        | S | ***                | THREE-PER-EM SPACE
U02005 | ***                        | S | ***                | FOUR-PER-EM SPACE
U02006 | ***                        | S | ***                | SIX-PER-EM SPACE
U02007 | ***                        | S | ***                | FIGURE SPACE
U02009 | ***                        | S | ***                | THIN SPACE
U0200A | ***                        | S | ***                | HAIR SPACE
U0200B | ***                        | S | ***                | ZERO WIDTH SPACE
U02010 | ***                        | P | ***                | HYPHEN
U02012 | ***                        | P | ***                | FIGURE DASH
U02013 | ***                        | P | ***                | EN DASH
U02014 | ***                        | P | ***                | EM DASH
U02015 | \horizbar                  | ! | \mathord           | HORIZONTAL BAR
U02017 | \twolowline                | ! | \mathord           | DOUBLE LOW LINE
U02020 | \dagger                    | R | \mathbin           | DAGGER
U02021 | \ddagger                   | R | \mathbin           | DOUBLE DAGGER
U02025 | \enleadertwodots           | ! | \mathord           | TWO DOT LEADER
U02038 | \caretinsert               | ! | \mathord           | CARET
U0203B | ***                        | N | ***                | REFERENCE MARK
U02043 | \hyphenbullet              | ! | \mathord           | HYPHEN BULLET
U02047 | \Question                  | ! | \mathord           | DOUBLE QUESTION MARK
U0204E | ***                        | B | ***                | LOW ASTERISK
U0204F | ***                        | R | ***                | REVERSED SEMICOLON
U02051 | ***                        | N | ***                | TWO ASTERISKS ALIGNED VERTICALLY
U02052 | ***                        | N | ***                | COMMERCIAL MINUS SIGN
U0205F | ***                        | S | ***                | MEDIUM MATHEMATICAL SPACE
U02061 | ***                        | B | ***                | FUNCTION APPLICATION
U02062 | ***                        | B | ***                | INVISIBLE TIMES
U02063 | ***                        | P | ***                | INVISIBLE SEPARATOR
U02064 | ***                        | X | ***                | INVISIBLE PLUS
U0207A | ***                        | N | ***                | SUPERSCRIPT PLUS SIGN
U0207B | ***                        | N | ***                | SUPERSCRIPT MINUS
U0207C | ***                        | N | ***                | SUPERSCRIPT EQUALS SIGN
U0207D | ***                        | N | ***                | SUPERSCRIPT LEFT PARENTHESIS
U0207E | ***                        | N | ***                | SUPERSCRIPT RIGHT PARENTHESIS
U0208A | ***                        | N | ***                | SUBSCRIPT PLUS SIGN
U0208B | ***                        | N | ***                | SUBSCRIPT MINUS
U0208C | ***                        | N | ***                | SUBSCRIPT EQUALS SIGN
U0208D | ***                        | N | ***                | SUBSCRIPT LEFT PARENTHESIS
U0208E | ***                        | N | ***                | SUBSCRIPT RIGHT PARENTHESIS
U020AC | \euro                      | ! | \mathord           | EURO SIGN
U020D3 | ***                        | X | ***                | COMBINING SHORT VERTICAL LINE OVERLAY
U020D4 | ***                        | D | ***                | COMBINING ANTICLOCKWISE ARROW ABOVE
U020D5 | ***                        | D | ***                | COMBINING CLOCKWISE ARROW ABOVE
U020D6 | \overleftarrow             | D | \mathaccentwide    | COMBINING LEFT ARROW ABOVE
U020D8 | ***                        | D | ***                | COMBINING RING OVERLAY
U020D9 | ***                        | D | ***                | COMBINING CLOCKWISE RING OVERLAY
U020DA | ***                        | D | ***                | COMBINING ANTICLOCKWISE RING OVERLAY
U020DD | \enclosecircle             | D | \mathord           | COMBINING ENCLOSING CIRCLE
U020DE | \enclosesquare             | D | \mathord           | COMBINING ENCLOSING SQUARE
U020DF | \enclosediamond            | D | \mathord           | COMBINING ENCLOSING DIAMOND
U020E1 | \overleftrightarrow        | D | \mathaccentwide    | COMBINING LEFT RIGHT ARROW ABOVE
U020E4 | \enclosetriangle           | D | \mathord           | COMBINING ENCLOSING UPWARD POINTING TRIANGLE
U020E5 | ***                        | D | ***                | COMBINING REVERSE SOLIDUS OVERLAY
U020E6 | ***                        | D | ***                | COMBINING DOUBLE VERTICAL STROKE OVERLAY
U020E8 | \threeunderdot             | D | \mathbotaccent     | COMBINING TRIPLE UNDERDOT
U020EA | ***                        | D | ***                | COMBINING LEFTWARDS ARROW OVERLAY
U020EB | ***                        | D | ***                | COMBINING LONG DOUBLE SOLIDUS OVERLAY
U020EC | \underrightharpoondown     | D | \mathbotaccentwide | COMBINING RIGHTWARDS HARPOON WITH BARB DOWNWARDS
U020ED | \underleftharpoondown      | D | \mathbotaccentwide | COMBINING LEFTWARDS HARPOON WITH BARB DOWNWARDS
U020EE | \underleftarrow            | D | \mathbotaccentwide | COMBINING LEFT ARROW BELOW
U020EF | \underrightarrow           | D | \mathbotaccentwide | COMBINING RIGHT ARROW BELOW
U020F0 | \asteraccent               | ! | \mathaccent        | COMBINING ASTERISK ABOVE
U0210F | \hslash                    | N | \mathalpha         | PLANCK CONSTANT OVER TWO PI
U02126 | ***                        | N | ***                | OHM SIGN
U02129 | \m?turnediota              | N | \mathalpha         | TURNED GREEK SMALL LETTER IOTA
U0213C | \Bbbpi                     | A | \mathord           | DOUBLE-STRUCK SMALL PI
U0213E | \BbbGamma                  | N | \mathalpha         | DOUBLE-STRUCK CAPITAL GAMMA
U0214A | \PropertyLine              | ! | \mathord           | PROPERTY LINE
U0214B | \upand                     | N | \mathbin           | TURNED AMPERSAND
U021A8 | \updownarrowbar            | R | \mathord           | UP DOWN ARROW WITH BASE
U021B4 | \linefeed                  | ! | \mathord           | RIGHTWARDS ARROW WITH CORNER DOWNWARDS
U021B5 | \carriagereturn            | ! | \mathord           | DOWNWARDS ARROW WITH CORNER LEFTWARDS
U021B8 | \barovernorthwestarrow     | ! | \mathord           | NORTH WEST ARROW TO LONG BAR
U021B9 | \barleftarrowrightarrowbar | ! | \mathord           | LEFTWARDS ARROW TO BAR OVER RIGHTWARDS ARROW TO BAR
U021BA | \acwopencirclearrow        | R | \mathord           | ANTICLOCKWISE OPEN CIRCLE ARROW
U021BB | \cwopencirclearrow         | R | \mathord           | CLOCKWISE OPEN CIRCLE ARROW
U021DE | \nHuparrow                 | R | \mathord           | UPWARDS ARROW WITH DOUBLE STROKE
U021DF | \nHdownarrow               | R | \mathord           | DOWNWARDS ARROW WITH DOUBLE STROKE
U021E0 | \leftdasharrow             | R | \mathord           | LEFTWARDS DASHED ARROW
U021E1 | \updasharrow               | R | \mathord           | UPWARDS DASHED ARROW
U021E2 | \rightdasharrow            | R | \mathord           | RIGHTWARDS DASHED ARROW
U021E3 | \downdasharrow             | R | \mathord           | DOWNWARDS DASHED ARROW
U021E6 | \leftwhitearrow            | R | \mathord           | LEFTWARDS WHITE ARROW
U021E7 | \upwhitearrow              | R | \mathord           | UPWARDS WHITE ARROW
U021E8 | \rightwhitearrow           | R | \mathord           | RIGHTWARDS WHITE ARROW
U021E9 | \downwhitearrow            | R | \mathord           | DOWNWARDS WHITE ARROW
U021EA | \whitearrowupfrombar       | R | \mathord           | UPWARDS WHITE ARROW FROM BAR
U021EB | ***                        | R | ***                | UPWARDS WHITE ARROW ON PEDESTAL
U021EC | ***                        | R | ***                | UPWARDS WHITE ARROW ON PEDESTAL WITH HORIZONTAL BAR
U021ED | ***                        | R | ***                | UPWARDS WHITE ARROW ON PEDESTAL WITH VERTICAL BAR
U021EE | ***                        | R | ***                | UPWARDS WHITE DOUBLE ARROW
U021EF | ***                        | R | ***                | UPWARDS WHITE DOUBLE ARROW ON PEDESTAL
U021F0 | ***                        | R | ***                | RIGHTWARDS WHITE ARROW FROM WALL
U021F1 | ***                        | R | ***                | NORTH WEST ARROW TO CORNER
U021F2 | ***                        | R | ***                | SOUTH EAST ARROW TO CORNER
U021F3 | ***                        | R | ***                | UP DOWN WHITE ARROW
U02202 | \partial                   | N | \mathalpha         | PARTIAL DIFFERENTIAL
U02207 | \nabla                     | U | \mathalpha         | NABLA
U0221A | \sqrt                      | L | \mathopen          | SQUARE ROOT
U0221B | \cuberoot                  | L | \mathopen          | CUBE ROOT
U0221C | \fourthroot                | L | \mathopen          | FOURTH ROOT
U02234 | \therefore                 | R | \mathord           | THEREFORE
U02235 | \because                   | R | \mathord           | BECAUSE
U022A5 | \bot                       | R | \mathord           | UP TACK
U022B9 | \hermitmatrix              | B | \mathord           | HERMITIAN CONJUGATE MATRIX
U022EF | \unicodecdots              | R | \mathord           | MIDLINE HORIZONTAL ELLIPSIS
U02312 | \profline                  | ! | \mathord           | ARC
U02313 | \profsurf                  | ! | \mathord           | SEGMENT
U02317 | \viewdata                  | ! | \mathord           | VIEWDATA SQUARE
U0232C | \varhexagonlrbonds         | ! | \mathord           | BENZENE RING
U02332 | \conictaper                | ! | \mathord           | CONICAL TAPER
U02340 | \APLnotbackslash           | ! | \mathord           | APL FUNCTIONAL SYMBOL BACKSLASH BAR
U02353 | \APLboxupcaret             | ! | \mathord           | APL FUNCTIONAL SYMBOL QUAD UP CARET
U02370 | \APLboxquestion            | ! | \mathord           | APL FUNCTIONAL SYMBOL QUAD QUESTION
U0237C | \rangledownzigzagarrow     | R | \mathord           | RIGHT ANGLE WITH DOWNWARDS ZIGZAG ARROW
U023B0 | \lmoustache                | R | \mathopen          | UPPER LEFT OR LOWER RIGHT CURLY BRACKET SECTION
U023B1 | \rmoustache                | R | \mathclose         | UPPER RIGHT OR LOWER LEFT CURLY BRACKET SECTION
U023B4 | \overbracket               | N | \mathover          | TOP SQUARE BRACKET
U023B5 | \underbracket              | N | \mathunder         | BOTTOM SQUARE BRACKET
U023B8 | \lvboxline                 | ! | \mathord           | LEFT VERTICAL BOX LINE
U023B9 | \rvboxline                 | ! | \mathord           | RIGHT VERTICAL BOX LINE
U023CE | \varcarriagereturn         | ! | \mathord           | RETURN SYMBOL
U023D0 | ***                        | G | ***                | VERTICAL LINE EXTENSION
U023DC | \overparen                 | N | \mathover          | TOP PARENTHESIS
U023DD | \underparen                | N | \mathunder         | BOTTOM PARENTHESIS
U023DE | \overbrace                 | N | \mathover          | TOP CURLY BRACKET
U023DF | \underbrace                | N | \mathunder         | BOTTOM CURLY BRACKET
U02422 | \blanksymbol               | ! | \mathord           | BLANK SYMBOL
U02423 | \mathvisiblespace          | ! | \mathord           | OPEN BOX
U024C8 | ***                        | N | ***                | CIRCLED LATIN CAPITAL LETTER S
U02506 | \bdtriplevdash             | ! | \mathord           | BOX DRAWINGS LIGHT TRIPLE DASH VERTICAL
U02580 | \blockuphalf               | ! | \mathord           | UPPER HALF BLOCK
U02584 | \blocklowhalf              | ! | \mathord           | LOWER HALF BLOCK
U02588 | \blockfull                 | ! | \mathord           | FULL BLOCK
U0258C | \blocklefthalf             | ! | \mathord           | LEFT HALF BLOCK
U02590 | \blockrighthalf            | ! | \mathord           | RIGHT HALF BLOCK
U02591 | \blockqtrshaded            | ! | \mathord           | LIGHT SHADE
U02592 | \blockhalfshaded           | ! | \mathord           | MEDIUM SHADE
U02593 | \blockthreeqtrshaded       | ! | \mathord           | DARK SHADE
U025A2 | \squoval                   | ! | \mathord           | WHITE SQUARE WITH ROUNDED CORNERS
U025A3 | \blackinwhitesquare        | ! | \mathord           | WHITE SQUARE CONTAINING BLACK SMALL SQUARE
U025A4 | \squarehfill               | ! | \mathord           | SQUARE WITH HORIZONTAL FILL
U025A5 | \squarevfill               | ! | \mathord           | SQUARE WITH VERTICAL FILL
U025A6 | \squarehvfill              | ! | \mathord           | SQUARE WITH ORTHOGONAL CROSSHATCH FILL
U025A7 | \squarenwsefill            | ! | \mathord           | SQUARE WITH UPPER LEFT TO LOWER RIGHT FILL
U025A8 | \squareneswfill            | ! | \mathord           | SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL
U025A9 | \squarecrossfill           | ! | \mathord           | SQUARE WITH DIAGONAL CROSSHATCH FILL
U025AC | \hrectangleblack           | ! | \mathord           | BLACK RECTANGLE
U025B2 | \bigblacktriangleup        | B | \mathord           | BLACK UP-POINTING TRIANGLE
U025B4 | \blacktriangle             | B | \mathord           | BLACK UP-POINTING SMALL TRIANGLE
U025B5 | \vartriangle               | B | \mathrel           | WHITE UP-POINTING SMALL TRIANGLE
U025B6 | \blacktriangleright        | B | \mathord           | BLACK RIGHT-POINTING TRIANGLE
U025B8 | \smallblacktriangleright   | B | \mathord           | BLACK RIGHT-POINTING SMALL TRIANGLE
U025B9 | \smalltriangleright        | B | \mathord           | WHITE RIGHT-POINTING SMALL TRIANGLE
U025BA | \blackpointerright         | ! | \mathord           | BLACK RIGHT-POINTING POINTER
U025BB | \whitepointerright         | ! | \mathord           | WHITE RIGHT-POINTING POINTER
U025BC | \bigblacktriangledown      | B | \mathord           | BLACK DOWN-POINTING TRIANGLE
U025BD | \bigtriangledown           | B | \mathord           | WHITE DOWN-POINTING TRIANGLE
U025BE | \blacktriangledown         | B | \mathord           | BLACK DOWN-POINTING SMALL TRIANGLE
U025BF | \triangledown              | B | \mathord           | WHITE DOWN-POINTING SMALL TRIANGLE
U025C0 | \blacktriangleleft         | B | \mathord           | BLACK LEFT-POINTING TRIANGLE
U025C2 | \smallblacktriangleleft    | B | \mathord           | BLACK LEFT-POINTING SMALL TRIANGLE
U025C3 | \smalltriangleleft         | B | \mathord           | WHITE LEFT-POINTING SMALL TRIANGLE
U025C4 | \blackpointerleft          | B | \mathord           | BLACK LEFT-POINTING POINTER
U025C5 | \whitepointerleft          | B | \mathord           | WHITE LEFT-POINTING POINTER
U025CA | \mdlgwhtlozenge            | B | \mathord           | LOZENGE
U025CC | \dottedcircle              | ! | \mathord           | DOTTED CIRCLE
U025CD | \circlevertfill            | ! | \mathord           | CIRCLE WITH VERTICAL FILL
U025D4 | \circleurquadblack         | ! | \mathord           | CIRCLE WITH UPPER RIGHT QUADRANT BLACK
U025D5 | \blackcircleulquadwhite    | ! | \mathord           | CIRCLE WITH ALL BUT UPPER LEFT QUADRANT BLACK
U025D8 | \inversebullet             | ! | \mathord           | INVERSE BULLET
U025D9 | \inversewhitecircle        | ! | \mathord           | INVERSE WHITE CIRCLE
U025DA | \invwhiteupperhalfcircle   | ! | \mathord           | UPPER HALF INVERSE WHITE CIRCLE
U025DB | \invwhitelowerhalfcircle   | ! | \mathord           | LOWER HALF INVERSE WHITE CIRCLE
U025DC | \ularc                     | ! | \mathord           | UPPER LEFT QUADRANT CIRCULAR ARC
U025DD | \urarc                     | ! | \mathord           | UPPER RIGHT QUADRANT CIRCULAR ARC
U025DE | \lrarc                     | ! | \mathord           | LOWER RIGHT QUADRANT CIRCULAR ARC
U025DF | \llarc                     | ! | \mathord           | LOWER LEFT QUADRANT CIRCULAR ARC
U025E0 | \topsemicircle             | ! | \mathord           | UPPER HALF CIRCLE
U025E1 | \botsemicircle             | ! | \mathord           | LOWER HALF CIRCLE
U025E6 | \smwhtcircle               | B | \mathord           | WHITE BULLET
U025EC | \trianglecdot              | B | \mathord           | WHITE UP-POINTING TRIANGLE WITH DOT
U025ED | \triangleleftblack         | ! | \mathord           | UP-POINTING TRIANGLE WITH LEFT HALF BLACK
U025EE | \trianglerightblack        | ! | \mathord           | UP-POINTING TRIANGLE WITH RIGHT HALF BLACK
U025F0 | \squareulquad              | ! | \mathord           | WHITE SQUARE WITH UPPER LEFT QUADRANT
U025F1 | \squarellquad              | ! | \mathord           | WHITE SQUARE WITH LOWER LEFT QUADRANT
U025F2 | \squarelrquad              | ! | \mathord           | WHITE SQUARE WITH LOWER RIGHT QUADRANT
U025F3 | \squareurquad              | ! | \mathord           | WHITE SQUARE WITH UPPER RIGHT QUADRANT
U025F4 | \circleulquad              | ! | \mathord           | WHITE CIRCLE WITH UPPER LEFT QUADRANT
U025F5 | \circlellquad              | ! | \mathord           | WHITE CIRCLE WITH LOWER LEFT QUADRANT
U025F6 | \circlelrquad              | ! | \mathord           | WHITE CIRCLE WITH LOWER RIGHT QUADRANT
U025F7 | \circleurquad              | ! | \mathord           | WHITE CIRCLE WITH UPPER RIGHT QUADRANT
U025F8 | \ultriangle                | B | \mathord           | UPPER LEFT TRIANGLE
U025F9 | \urtriangle                | B | \mathord           | UPPER RIGHT TRIANGLE
U025FA | \lltriangle                | B | \mathord           | LOWER LEFT TRIANGLE
U025FB | \mdwhtsquare               | B | \mathord           | WHITE MEDIUM SQUARE
U025FC | \mdblksquare               | B | \mathord           | BLACK MEDIUM SQUARE
U025FD | \mdsmwhtsquare             | B | \mathord           | WHITE MEDIUM SMALL SQUARE
U025FE | \mdsmblksquare             | B | \mathord           | BLACK MEDIUM SMALL SQUARE
U025FF | \lrtriangle                | B | \mathord           | LOWER RIGHT TRIANGLE
U02605 | \bigstar                   | B | \mathord           | BLACK STAR
U02606 | \bigwhitestar              | B | \mathord           | WHITE STAR
U0260C | ***                        | N | ***                | CONJUNCTION
U02612 | ***                        | N | ***                | BALLOT BOX WITH X
U02621 | \danger                    | ! | \mathord           | CAUTION SIGN
U0263B | \blacksmiley               | ! | \mathord           | BLACK SMILING FACE
U0263C | \sun                       | ! | \mathord           | WHITE SUN WITH RAYS
U0263F | ***                        | N | ***                | MERCURY
U02641 | ***                        | N | ***                | EARTH
U02643 | ***                        | N | ***                | JUPITER
U02644 | ***                        | N | ***                | SATURN
U02646 | ***                        | N | ***                | NEPTUNE
U02647 | ***                        | N | ***                | PLUTO
U02648 | ***                        | N | ***                | ARIES
U02649 | ***                        | N | ***                | TAURUS
U0266A | \eighthnote                | ! | \mathord           | EIGHTH NOTE
U0266B | \twonotes                  | ! | \mathord           | BEAMED EIGHTH NOTES
U0267E | \acidfree                  | ! | \mathord           | PERMANENT PAPER SIGN
U026A5 | \Hermaphrodite             | ! | \mathord           | MALE AND FEMALE SIGN
U0273D | \dingasterisk              | ! | \mathord           | HEAVY TEARDROP-SPOKED ASTERISK
U0279B | \draftingarrow             | ! | \mathord           | DRAFTING POINT RIGHTWARDS ARROW
U027C3 | \subsetcirc                | R | \mathord           | OPEN SUBSET
U027C4 | \supsetcirc                | R | \mathord           | OPEN SUPERSET
U027C5 | \lbag                      | R | \mathopen          | LEFT S-SHAPED BAG DELIMITER
U027C6 | \rbag                      | R | \mathclose         | RIGHT S-SHAPED BAG DELIMITER
U027C7 | \veedot                    | R | \mathbin           | OR WITH DOT INSIDE
U027CA | ***                        | R | ***                | VERTICAL BAR WITH HORIZONTAL STROKE
U027CB | \diagup                    | R | \mathord           | MATHEMATICAL RISING DIAGONAL
U027CC | \longdivision              | L | \mathopen          | LONG DIVISION
U027CD | \diagdown                  | R | \mathord           | MATHEMATICAL FALLING DIAGONAL
U027CE | ***                        | B | ***                | SQUARED LOGICAL AND
U027CF | ***                        | B | ***                | SQUARED LOGICAL OR
U0292B | \rdiagovfdiag              | R | \mathord           | RISING DIAGONAL CROSSING FALLING DIAGONAL
U0292C | \fdiagovrdiag              | R | \mathord           | FALLING DIAGONAL CROSSING RISING DIAGONAL
U0292D | \seovnearrow               | R | \mathord           | SOUTH EAST ARROW CROSSING NORTH EAST ARROW
U0292E | \neovsearrow               | R | \mathord           | NORTH EAST ARROW CROSSING SOUTH EAST ARROW
U0292F | \fdiagovnearrow            | R | \mathord           | FALLING DIAGONAL CROSSING NORTH EAST ARROW
U02930 | \rdiagovsearrow            | R | \mathord           | RISING DIAGONAL CROSSING SOUTH EAST ARROW
U02931 | \neovnwarrow               | R | \mathord           | NORTH EAST ARROW CROSSING NORTH WEST ARROW
U02932 | \nwovnearrow               | R | \mathord           | NORTH WEST ARROW CROSSING NORTH EAST ARROW
U02934 | \uprightcurvearrow         | R | \mathord           | ARROW POINTING RIGHTWARDS THEN CURVING UPWARDS
U02935 | \downrightcurvedarrow      | R | \mathord           | ARROW POINTING RIGHTWARDS THEN CURVING DOWNWARDS
U02982 | \typecolon                 | F | \mathrel           | Z NOTATION TYPE COLON
U02999 | \fourvdots                 | F | \mathord           | DOTTED FENCE
U0299A | \vzigzag                   | F | \mathord           | VERTICAL ZIGZAG LINE
U029B5 | \circlehbar                | N | \mathbin           | CIRCLE WITH HORIZONTAL BAR
U029CD | \triangleserifs            | N | \mathbin           | TRIANGLE WITH SERIFS AT BOTTOM
U02B00 | ***                        | R | ***                | NORTH EAST WHITE ARROW
U02B01 | ***                        | R | ***                | NORTH WEST WHITE ARROW
U02B02 | ***                        | R | ***                | SOUTH EAST WHITE ARROW
U02B03 | ***                        | R | ***                | SOUTH WEST WHITE ARROW
U02B04 | ***                        | R | ***                | LEFT RIGHT WHITE ARROW
U02B05 | ***                        | R | ***                | LEFTWARDS BLACK ARROW
U02B06 | ***                        | R | ***                | UPWARDS BLACK ARROW
U02B07 | ***                        | R | ***                | DOWNWARDS BLACK ARROW
U02B08 | ***                        | R | ***                | NORTH EAST BLACK ARROW
U02B09 | ***                        | R | ***                | NORTH WEST BLACK ARROW
U02B0A | ***                        | R | ***                | SOUTH EAST BLACK ARROW
U02B0B | ***                        | R | ***                | SOUTH WEST BLACK ARROW
U02B0C | ***                        | R | ***                | LEFT RIGHT BLACK ARROW
U02B0D | ***                        | R | ***                | UP DOWN BLACK ARROW
U02B0E | ***                        | R | ***                | RIGHTWARDS ARROW WITH TIP DOWNWARDS
U02B0F | ***                        | R | ***                | RIGHTWARDS ARROW WITH TIP UPWARDS
U02B10 | ***                        | R | ***                | LEFTWARDS ARROW WITH TIP DOWNWARDS
U02B11 | ***                        | R | ***                | LEFTWARDS ARROW WITH TIP UPWARDS
U02B1A | \dottedsquare              | ! | \mathord           | DOTTED SQUARE
U02B1B | \lgblksquare               | ! | \mathord           | BLACK LARGE SQUARE
U02B1C | \lgwhtsquare               | ! | \mathord           | WHITE LARGE SQUARE
U02B1D | \vysmblksquare             | ! | \mathord           | BLACK VERY SMALL SQUARE
U02B1E | \vysmwhtsquare             | ! | \mathord           | WHITE VERY SMALL SQUARE
U02B1F | \pentagonblack             | ! | \mathord           | BLACK PENTAGON
U02B24 | \lgblkcircle               | ! | \mathord           | BLACK LARGE CIRCLE
U02B25 | \mdblkdiamond              | ! | \mathord           | BLACK MEDIUM DIAMOND
U02B26 | \mdwhtdiamond              | ! | \mathord           | WHITE MEDIUM DIAMOND
U02B27 | \mdblklozenge              | ! | \mathord           | BLACK MEDIUM LOZENGE
U02B28 | \mdwhtlozenge              | ! | \mathord           | WHITE MEDIUM LOZENGE
U02B29 | \smblkdiamond              | ! | \mathord           | BLACK SMALL DIAMOND
U02B2A | \smblklozenge              | ! | \mathord           | BLACK SMALL LOZENGE
U02B2B | \smwhtlozenge              | ! | \mathord           | WHITE SMALL LOZENGE
U02B2C | \blkhorzoval               | ! | \mathord           | BLACK HORIZONTAL ELLIPSE
U02B2D | \whthorzoval               | ! | \mathord           | WHITE HORIZONTAL ELLIPSE
U02B2E | \blkvertoval               | ! | \mathord           | BLACK VERTICAL ELLIPSE
U02B2F | \whtvertoval               | ! | \mathord           | WHITE VERTICAL ELLIPSE
U02B45 | \LLeftarrow                | ! | \mathrel           | LEFTWARDS QUADRUPLE ARROW
U02B46 | \RRightarrow               | ! | \mathrel           | RIGHTWARDS QUADRUPLE ARROW
U02B50 | \medwhitestar              | ! | \mathord           | WHITE MEDIUM STAR
U02B51 | \medblackstar              | ! | \mathord           | BLACK SMALL STAR
U02B52 | \smwhitestar               | ! | \mathord           | WHITE SMALL STAR
U02B53 | \rightpentagonblack        | ! | \mathord           | BLACK RIGHT-POINTING PENTAGON
U02B54 | \rightpentagon             | ! | \mathord           | WHITE RIGHT-POINTING PENTAGON
U03008 | ***                        | X | ***                | LEFT ANGLE BRACKET
U03009 | ***                        | X | ***                | RIGHT ANGLE BRACKET
U03012 | \postalmark                | ! | \mathord           | POSTAL MARK
U03014 | \lbrbrak                   | ! | \mathopen          | LEFT TORTOISE SHELL BRACKET
U03015 | \rbrbrak                   | ! | \mathclose         | RIGHT TORTOISE SHELL BRACKET
U03018 | \Lbrbrak                   | ! | \mathopen          | LEFT WHITE TORTOISE SHELL BRACKET
U03019 | \Rbrbrak                   | ! | \mathclose         | RIGHT WHITE TORTOISE SHELL BRACKET
U0301A | ***                        | X | ***                | LEFT WHITE SQUARE BRACKET
U0301B | ***                        | X | ***                | RIGHT WHITE SQUARE BRACKET
U03030 | \hzigzag                   | ! | \mathord           | WAVY DASH
U0306E | ***                        | N | ***                | HIRAGANA LETTER NO
U0FB29 | ***                        | X | ***                | HEBREW LETTER ALTERNATIVE PLUS SIGN
U0FE00 | ***                        | D | ***                | VARIATION SELECTOR-1
U0FE61 | ***                        | X | ***                | SMALL ASTERISK
U0FE62 | ***                        | X | ***                | SMALL PLUS SIGN
U0FE63 | ***                        | X | ***                | SMALL HYPHEN-MINUS
U0FE64 | ***                        | X | ***                | SMALL LESS-THAN SIGN
U0FE65 | ***                        | X | ***                | SMALL GREATER-THAN SIGN
U0FE66 | ***                        | X | ***                | SMALL EQUALS SIGN
U0FE68 | ***                        | X | ***                | SMALL REVERSE SOLIDUS
U0FF0B | ***                        | X | ***                | FULLWIDTH PLUS SIGN
U0FF1C | ***                        | X | ***                | FULLWIDTH LESS-THAN SIGN
U0FF1D | ***                        | X | ***                | FULLWIDTH EQUALS SIGN
U0FF1E | ***                        | X | ***                | FULLWIDTH GREATER-THAN SIGN
U0FF3C | ***                        | X | ***                | FULLWIDTH REVERSE SOLIDUS
U0FF3E | ***                        | X | ***                | FULLWIDTH CIRCUMFLEX ACCENT
U0FF5C | ***                        | X | ***                | FULLWIDTH VERTICAL LINE
U0FF5E | ***                        | X | ***                | FULLWIDTH TILDE
U0FFE2 | ***                        | X | ***                | FULLWIDTH NOT SIGN
U0FFE9 | ***                        | X | ***                | HALFWIDTH LEFTWARDS ARROW
U0FFEA | ***                        | X | ***                | HALFWIDTH UPWARDS ARROW
U0FFEB | ***                        | X | ***                | HALFWIDTH RIGHTWARDS ARROW
U0FFEC | ***                        | X | ***                | HALFWIDTH DOWNWARDS ARROW
U1EE00 | ***                        | A | ***                | ARABIC MATHEMATICAL ALEF
U1EE01 | ***                        | A | ***                | ARABIC MATHEMATICAL BEH
U1EE02 | ***                        | A | ***                | ARABIC MATHEMATICAL JEEM
U1EE03 | ***                        | A | ***                | ARABIC MATHEMATICAL DAL
U1EE05 | ***                        | A | ***                | ARABIC MATHEMATICAL WAW
U1EE06 | ***                        | A | ***                | ARABIC MATHEMATICAL ZAIN
U1EE07 | ***                        | A | ***                | ARABIC MATHEMATICAL HAH
U1EE08 | ***                        | A | ***                | ARABIC MATHEMATICAL TAH
U1EE09 | ***                        | A | ***                | ARABIC MATHEMATICAL YEH
U1EE0A | ***                        | A | ***                | ARABIC MATHEMATICAL KAF
U1EE0B | ***                        | A | ***                | ARABIC MATHEMATICAL LAM
U1EE0C | ***                        | A | ***                | ARABIC MATHEMATICAL MEEM
U1EE0D | ***                        | A | ***                | ARABIC MATHEMATICAL NOON
U1EE0E | ***                        | A | ***                | ARABIC MATHEMATICAL SEEN
U1EE0F | ***                        | A | ***                | ARABIC MATHEMATICAL AIN
U1EE10 | ***                        | A | ***                | ARABIC MATHEMATICAL FEH
U1EE11 | ***                        | A | ***                | ARABIC MATHEMATICAL SAD
U1EE12 | ***                        | A | ***                | ARABIC MATHEMATICAL QAF
U1EE13 | ***                        | A | ***                | ARABIC MATHEMATICAL REH
U1EE14 | ***                        | A | ***                | ARABIC MATHEMATICAL SHEEN
U1EE15 | ***                        | A | ***                | ARABIC MATHEMATICAL TEH
U1EE16 | ***                        | A | ***                | ARABIC MATHEMATICAL THEH
U1EE17 | ***                        | A | ***                | ARABIC MATHEMATICAL KHAH
U1EE18 | ***                        | A | ***                | ARABIC MATHEMATICAL THAL
U1EE19 | ***                        | A | ***                | ARABIC MATHEMATICAL DAD
U1EE1A | ***                        | A | ***                | ARABIC MATHEMATICAL ZAH
U1EE1B | ***                        | A | ***                | ARABIC MATHEMATICAL GHAIN
U1EE1C | ***                        | A | ***                | ARABIC MATHEMATICAL DOTLESS BEH
U1EE1D | ***                        | A | ***                | ARABIC MATHEMATICAL DOTLESS NOON
U1EE1E | ***                        | A | ***                | ARABIC MATHEMATICAL DOTLESS FEH
U1EE1F | ***                        | A | ***                | ARABIC MATHEMATICAL DOTLESS QAF
U1EE21 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL BEH
U1EE22 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL JEEM
U1EE24 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL HEH
U1EE27 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL HAH
U1EE29 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL YEH
U1EE2A | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL KAF
U1EE2B | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL LAM
U1EE2C | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL MEEM
U1EE2D | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL NOON
U1EE2E | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL SEEN
U1EE2F | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL AIN
U1EE30 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL FEH
U1EE31 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL SAD
U1EE32 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL QAF
U1EE34 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL SHEEN
U1EE35 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL TEH
U1EE36 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL THEH
U1EE37 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL KHAH
U1EE39 | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL DAD
U1EE3B | ***                        | A | ***                | ARABIC MATHEMATICAL INITIAL GHAIN
U1EE42 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED JEEM
U1EE47 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED HAH
U1EE49 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED YEH
U1EE4B | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED LAM
U1EE4D | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED NOON
U1EE4E | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED SEEN
U1EE4F | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED AIN
U1EE51 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED SAD
U1EE52 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED QAF
U1EE54 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED SHEEN
U1EE57 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED KHAH
U1EE59 | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED DAD
U1EE5B | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED GHAIN
U1EE5D | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED DOTLESS NOON
U1EE5F | ***                        | A | ***                | ARABIC MATHEMATICAL TAILED DOTLESS QAF
U1EE61 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED BEH
U1EE62 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED JEEM
U1EE64 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED HEH
U1EE67 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED HAH
U1EE68 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED TAH
U1EE69 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED YEH
U1EE6A | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED KAF
U1EE6C | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED MEEM
U1EE6D | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED NOON
U1EE6E | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED SEEN
U1EE6F | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED AIN
U1EE70 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED FEH
U1EE71 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED SAD
U1EE72 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED QAF
U1EE74 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED SHEEN
U1EE75 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED TEH
U1EE76 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED THEH
U1EE77 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED KHAH
U1EE79 | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED DAD
U1EE7A | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED ZAH
U1EE7B | ***                        | A | ***                | ARABIC MATHEMATICAL STRETCHED GHAIN
U1EE7C | ***                        | A | ***                | ARABIC MATHEMATICAL STRECHED DOTLESS BEH
U1EE7E | ***                        | A | ***                | ARABIC MATHEMATICAL STRECHED DOTLESS FEH
U1EE80 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED ALEF
U1EE81 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED BEH
U1EE82 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED JEEM
U1EE83 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED DAL
U1EE84 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED HEH
U1EE85 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED WAW
U1EE86 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED ZAIN
U1EE87 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED HAH
U1EE88 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED TAH
U1EE89 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED YEH
U1EE8B | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED LAM
U1EE8C | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED MEEM
U1EE8D | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED NOON
U1EE8E | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED SEEN
U1EE8F | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED AIN
U1EE90 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED FEH
U1EE91 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED SAD
U1EE92 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED QAF
U1EE93 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED REH
U1EE94 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED SHEEN
U1EE95 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED TEH
U1EE96 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED THEH
U1EE97 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED KHAH
U1EE98 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED THAL
U1EE99 | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED DAD
U1EE9A | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED ZAH
U1EE9B | ***                        | A | ***                | ARABIC MATHEMATICAL LOOPED GHAIN
U1EEA1 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK BEH
U1EEA2 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK JEEM
U1EEA3 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK DAL
U1EEA5 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK WAW
U1EEA6 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK ZAIN
U1EEA7 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK HAH
U1EEA8 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK TAH
U1EEA9 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK YEH
U1EEAB | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK LAM
U1EEAC | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK MEEM
U1EEAD | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK NOON
U1EEAE | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK SEEN
U1EEAF | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK AIN
U1EEB0 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK FEH
U1EEB1 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK SAD
U1EEB2 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK QAF
U1EEB3 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK REH
U1EEB4 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK SHEEN
U1EEB5 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK TEH
U1EEB6 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK THEH
U1EEB7 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK KHAH
U1EEB8 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK THAL
U1EEB9 | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK DAD
U1EEBA | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK ZAH
U1EEBB | ***                        | A | ***                | ARABIC MATHEMATICAL DOUBLE STRUCK GHAIN
U1EEF0 | \arabicmaj                 | A | \mathop            | ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WITH TATWEEL
U1EEF1 | \arabichad                 | A | \mathop            | ARABIC MATHEMATICAL OPERATOR HAH WITH DAL

Add info or delete as appropriate:

josephwright commented 10 months ago

Suggestion here is we tidy up and pre-load the corrected data in the format?

davidcarlisle commented 10 months ago

@josephwright well this issue is just about getting the lists aligned or at least differences documented.

We could then consider working similarly to lccode tables, initialising the format based on unicode data, then over-riding with tex specific corrections and historic choices in a separate phase, but that would be a new issue. Actually I thought pre-loading the table was an old issue, it's certainly been raised before, but a search of issues didn't show anything just now.

davidcarlisle commented 10 months ago

I updated the query to also log (marked !) characters with an entry in unicode-math-table but not in MathClass-15.

The list in the issue description is updated.

davidcarlisle commented 10 months ago

The entries marked ??? (where unicode.math asserts ther is a unicode-math definition that is not found are

U00332 | \underbar                  | D | ???                | COMBINING LOW LINE
U003D0 | \m?upvarbeta               | A | ???                | GREEK BETA SYMBOL
U003D8 | \m?upoldKoppa              | N | ???                | GREEK LETTER ARCHAIC KOPPA
U003D9 | \m?upoldkoppa              | N | ???                | GREEK SMALL LETTER ARCHAIC KOPPA
U003DA | \m?upStigma                | A | ???                | GREEK LETTER STIGMA
U003DB | \m?upstigma                | A | ???                | GREEK SMALL LETTER STIGMA
U003DE | \m?upKoppa                 | ! | ???                | GREEK LETTER KOPPA
U003DF | \m?upkoppa                 | ! | ???                | GREEK SMALL LETTER KOPPA
U003E0 | \m?upSampi                 | A | ???                | GREEK LETTER SAMPI
U003E1 | \m?upsampi                 | A | ???                | GREEK SMALL LETTER SAMPI

I think the first is just an error \underbar is obviously defined but I don't think it's associated with U+0332 in any real way and I should delete this entry in unicode.xml

The others are all old Greek. @wspr I must have thought these were defined in unicode-math at some point, has this changed or are they defined in some option?, I couldn't see them, if not I should probably just similarly correct unicode.xml and delete these entries?

wspr commented 10 months ago

I think they were defined once upon a time, but I came up absolutely blank when looking for examples of them anyway, so I think I assumed they were examples of glyphs from a font that may have been associated with maths but wasn’t actually used in maths typesetting.

If you have even hints that they should be included in UM I’m happy to bring them back.

(I suspect that quite a number of the geometrical symbols and graphical looking arrows are also in this category.)

davidcarlisle commented 10 months ago

@wspr thanks, I'll delete them for now...

done, list in the description updated.

wspr commented 10 months ago

I’m actually a bit torn on this; I like the idea of providing more symbols for people to extend their repertoire with. And these symbols are in STIX Two, so I’d suggest we bring them back in UM rather than keep them deleted… WDYT?

davidcarlisle commented 10 months ago

Fine by me, now the list in unicode-math-table matches what unicode.xml says is in unicode-math so if you add them here, I'll add them there and keep in sync...

davidcarlisle commented 10 months ago

It's probably worth reviewing the cases where there are different assignments (rather than missing definitions) especially if we might consider pre-initialising the mathcodes from the MathClass-xx file.

I have also omitted here assignments to Unicode combining characters which will necessarily require modified (or probably missing) support in tex.

delimiters


I plan to raise an issue at Unicode on these left and right delimiters that are marked as infix relations

U023B0 | \lmoustache                | R | \mathopen          | UPPER LEFT OR LOWER RIGHT CURLY BRACKET SECTION
U023B1 | \rmoustache                | R | \mathclose         | UPPER RIGHT OR LOWER LEFT CURLY BRACKET SECTION
U027C5 | \lbag                      | R | \mathopen          | LEFT S-SHAPED BAG DELIMITER
U027C6 | \rbag                      | R | \mathclose         | RIGHT S-SHAPED BAG DELIMITER

radicals

These are almost always accessed via (U)radical so the mathclass is perhaps not so important

U0221A | \sqrt                      | L | \mathopen          | SQUARE ROOT
U0221B | \cuberoot                  | L | \mathopen          | CUBE ROOT
U0221C | \fourthroot                | L | \mathopen          | FOURTH ROOT
U027CC | \longdivision              | L | \mathopen          | LONG DIVISION

Arabic operators

mathord or mathop?

U1EEF0 | \arabicmaj                 | A | \mathop            | ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WITH TATWEEL
U1EEF1 | \arabichad                 | A | \mathop            | ARABIC MATHEMATICAL OPERATOR HAH WITH DAL

Punctuation

It isn't clear if Unicode P really relates to \mathpunct which would add spacing in 1.5 or should a name such as \mathperiod relate to a sentence full stop rather than decimal point and be spaced that way even if the mathcode of . is mathord not mathpunct?

U00021 | \mathexclam                | N | \mathclose         | EXCLAMATION MARK
U0002E | \mathperiod                | P | \mathord           | FULL STOP
U0003F | \mathquestion              | P | \mathord           | QUESTION MARK

arrows etc

mathrel or mathord?

U021A8 | \updownarrowbar            | R | \mathord           | UP DOWN ARROW WITH BASE
U021BA | \acwopencirclearrow        | R | \mathord           | ANTICLOCKWISE OPEN CIRCLE ARROW
U021BB | \cwopencirclearrow         | R | \mathord           | CLOCKWISE OPEN CIRCLE ARROW
U021DE | \nHuparrow                 | R | \mathord           | UPWARDS ARROW WITH DOUBLE STROKE
U021DF | \nHdownarrow               | R | \mathord           | DOWNWARDS ARROW WITH DOUBLE STROKE
U021E0 | \leftdasharrow             | R | \mathord           | LEFTWARDS DASHED ARROW
U021E1 | \updasharrow               | R | \mathord           | UPWARDS DASHED ARROW
U021E2 | \rightdasharrow            | R | \mathord           | RIGHTWARDS DASHED ARROW
U021E3 | \downdasharrow             | R | \mathord           | DOWNWARDS DASHED ARROW
U021E6 | \leftwhitearrow            | R | \mathord           | LEFTWARDS WHITE ARROW
U021E7 | \upwhitearrow              | R | \mathord           | UPWARDS WHITE ARROW
U021E8 | \rightwhitearrow           | R | \mathord           | RIGHTWARDS WHITE ARROW
U021E9 | \downwhitearrow            | R | \mathord           | DOWNWARDS WHITE ARROW
U021EA | \whitearrowupfrombar       | R | \mathord           | UPWARDS WHITE ARROW FROM BAR
U02234 | \therefore                 | R | \mathord           | THEREFORE
U02235 | \because                   | R | \mathord           | BECAUSE
U022A5 | \bot                       | R | \mathord           | UP TACK
U022EF | \unicodecdots              | R | \mathord           | MIDLINE HORIZONTAL ELLIPSIS
U0237C | \rangledownzigzagarrow     | R | \mathord           | RIGHT ANGLE WITH DOWNWARDS ZIGZAG ARROW
U027C3 | \subsetcirc                | R | \mathord           | OPEN SUBSET
U027C4 | \supsetcirc                | R | \mathord           | OPEN SUPERSET
U027C7 | \veedot                    | R | \mathbin           | OR WITH DOT INSIDE
U027CB | \diagup                    | R | \mathord           | MATHEMATICAL RISING DIAGONAL
U027CD | \diagdown                  | R | \mathord           | MATHEMATICAL FALLING DIAGONAL
U0292B | \rdiagovfdiag              | R | \mathord           | RISING DIAGONAL CROSSING FALLING DIAGONAL
U0292C | \fdiagovrdiag              | R | \mathord           | FALLING DIAGONAL CROSSING RISING DIAGONAL
U0292D | \seovnearrow               | R | \mathord           | SOUTH EAST ARROW CROSSING NORTH EAST ARROW
U0292E | \neovsearrow               | R | \mathord           | NORTH EAST ARROW CROSSING SOUTH EAST ARROW
U0292F | \fdiagovnearrow            | R | \mathord           | FALLING DIAGONAL CROSSING NORTH EAST ARROW
U02930 | \rdiagovsearrow            | R | \mathord           | RISING DIAGONAL CROSSING SOUTH EAST ARROW
U02931 | \neovnwarrow               | R | \mathord           | NORTH EAST ARROW CROSSING NORTH WEST ARROW
U02932 | \nwovnearrow               | R | \mathord           | NORTH WEST ARROW CROSSING NORTH EAST ARROW
U02934 | \uprightcurvearrow         | R | \mathord           | ARROW POINTING RIGHTWARDS THEN CURVING UPWARDS
U02935 | \downrightcurvedarrow      | R | \mathord           | ARROW POINTING RIGHTWARDS THEN CURVING DOWNWARDS

triangles etc

mathbin or mathord ?

U0002F | \mathslash                 | B | \mathord           | SOLIDUS
U0005C | \backslash                 | B | \mathord           | REVERSE SOLIDUS
U022B9 | \hermitmatrix              | B | \mathord           | HERMITIAN CONJUGATE MATRIX
U025B2 | \bigblacktriangleup        | B | \mathord           | BLACK UP-POINTING TRIANGLE
U025B4 | \blacktriangle             | B | \mathord           | BLACK UP-POINTING SMALL TRIANGLE
U025B5 | \vartriangle               | B | \mathrel           | WHITE UP-POINTING SMALL TRIANGLE
U025B6 | \blacktriangleright        | B | \mathord           | BLACK RIGHT-POINTING TRIANGLE
U025B8 | \smallblacktriangleright   | B | \mathord           | BLACK RIGHT-POINTING SMALL TRIANGLE
U025B9 | \smalltriangleright        | B | \mathord           | WHITE RIGHT-POINTING SMALL TRIANGLE
U025BC | \bigblacktriangledown      | B | \mathord           | BLACK DOWN-POINTING TRIANGLE
U025BD | \bigtriangledown           | B | \mathord           | WHITE DOWN-POINTING TRIANGLE
U025BE | \blacktriangledown         | B | \mathord           | BLACK DOWN-POINTING SMALL TRIANGLE
U025BF | \triangledown              | B | \mathord           | WHITE DOWN-POINTING SMALL TRIANGLE
U025C0 | \blacktriangleleft         | B | \mathord           | BLACK LEFT-POINTING TRIANGLE
U025C2 | \smallblacktriangleleft    | B | \mathord           | BLACK LEFT-POINTING SMALL TRIANGLE
U025C3 | \smalltriangleleft         | B | \mathord           | WHITE LEFT-POINTING SMALL TRIANGLE
U025C4 | \blackpointerleft          | B | \mathord           | BLACK LEFT-POINTING POINTER
U025C5 | \whitepointerleft          | B | \mathord           | WHITE LEFT-POINTING POINTER
U025CA | \mdlgwhtlozenge            | B | \mathord           | LOZENGE
U025E6 | \smwhtcircle               | B | \mathord           | WHITE BULLET
U025EC | \trianglecdot              | B | \mathord           | WHITE UP-POINTING TRIANGLE WITH DOT
U025F8 | \ultriangle                | B | \mathord           | UPPER LEFT TRIANGLE
U025F9 | \urtriangle                | B | \mathord           | UPPER RIGHT TRIANGLE
U025FA | \lltriangle                | B | \mathord           | LOWER LEFT TRIANGLE
U025FB | \mdwhtsquare               | B | \mathord           | WHITE MEDIUM SQUARE
U025FC | \mdblksquare               | B | \mathord           | BLACK MEDIUM SQUARE
U025FD | \mdsmwhtsquare             | B | \mathord           | WHITE MEDIUM SMALL SQUARE
U025FE | \mdsmblksquare             | B | \mathord           | BLACK MEDIUM SMALL SQUARE
U025FF | \lrtriangle                | B | \mathord           | LOWER RIGHT TRIANGLE
U02605 | \bigstar                   | B | \mathord           | BLACK STAR
U02606 | \bigwhitestar              | B | \mathord           | WHITE STAR

daggers

bin or rel?

U02020 | \dagger                    | R | \mathbin           | DAGGER
U02021 | \ddagger                   | R | \mathbin           | DOUBLE DAGGER

the rest

U0210F | \hslash                    | N | \mathalpha         | PLANCK CONSTANT OVER TWO PI
U02129 | \m?turnediota              | N | \mathalpha         | TURNED GREEK SMALL LETTER IOTA
U0213C | \Bbbpi                     | A | \mathord           | DOUBLE-STRUCK SMALL PI
U0213E | \BbbGamma                  | N | \mathalpha         | DOUBLE-STRUCK CAPITAL GAMMA
U0214B | \upand                     | N | \mathbin           | TURNED AMPERSAND
U02202 | \partial                   | N | \mathalpha         | PARTIAL DIFFERENTIAL
U02207 | \nabla                     | U | \mathalpha         | NABLA
U023B4 | \overbracket               | N | \mathover          | TOP SQUARE BRACKET
U023B5 | \underbracket              | N | \mathunder         | BOTTOM SQUARE BRACKET
U023DC | \overparen                 | N | \mathover          | TOP PARENTHESIS
U023DD | \underparen                | N | \mathunder         | BOTTOM PARENTHESIS
U023DE | \overbrace                 | N | \mathover          | TOP CURLY BRACKET
U023DF | \underbrace                | N | \mathunder         | BOTTOM CURLY BRACKET
U029B5 | \circlehbar                | N | \mathbin           | CIRCLE WITH HORIZONTAL BAR
U029CD | \triangleserifs            | N | \mathbin           | TRIANGLE WITH SERIFS AT BOTTOM
khaledhosny commented 3 months ago

Arabic operators

mathord or mathop?

U1EEF0 | \arabicmaj                 | A | \mathop            | ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WITH TATWEEL
U1EEF1 | \arabichad                 | A | \mathop            | ARABIC MATHEMATICAL OPERATOR HAH WITH DAL

U1EEF0 (\arabicmaj) is an alternative to \sum so should have the same math class, which is \mathop.

U1EEF1 (\arabichad) is an alternative to \lim, so should behave the same. \lim is not a single symbol, but it is defined as \mathop{lim}, so I think \mathop is appropriate here.