esoleyman / cldr-syriac

Track the CLDR Survey Tool for Syriac
2 stars 0 forks source link

Syriac Unicode CLDR Allowable Character Set #3

Open SargisYonan opened 3 years ago

SargisYonan commented 3 years ago

In an effort to come up with the most universal set of standardized words between Western and Assyrian readers and speakers, we should limit ourselves to characters in the Syriac character set that are shared between readers of both scripts for the purposes of Unicode CLDR:

  1. All standard characters in U+0710 SYRIAC LETTER ALAPH through U+072C SYRIAC LETTER TAW excluding the Garshuni characters [U+0727 SYRIAC LETTER REVERSED PE, U+071C SYRIAC LETTER TETH GARSHUNI, U+0714 SYRIAC LETTER GAMAL GARSHUNI]
  2. U+0308 COMBINING DIAERESIS - [ ̈ ] Syameh above (for pluralizations) - Note, Eastern does not use the below version. The above version is universal in this respect.
  3. U+0323 COMBINING DOT BELOW - [ ̣ ] for words like ܡ̣ܢ min
  4. U+0330 COMBINING TILDE BELOW - [ ̰ ] Majleana below
  5. U+0303 COMBINING TILDE -[ ̃ ] Majleana above
  6. U+0307 COMBINING DOT ABOVE - [ ̇ ] for feminizations like in ܗ̇ܝ as well as ܡ̇ܢ man
  7. U+0747 SYRIAC OBLIQUE LINE ABOVE - [ ݇ ] Talqana - Note, Eastern does not use the below version making this one the more universal one
  8. U+0742 SYRIAC RUKKAKHA - [ ݂ ] Note, this should be used over U+032E COMBINING BREVE BELOW under Peh since Western Assyrian does not use this character under Peh
  9. U+0741 SYRIAC QUSHSHAYA - [ ݁ ]
  10. U+070F SYRIAC ABBREVIATION MARK - e.g. ܬܫܒܘܚܬܐ --> ܬ܏ܫܒܘ -- also for numbers
  11. U+070A SYRIAC CONTRACTION - ܩܫ܊ <-- ܩܫܐ

Note that this list excludes all vowel diacritics from both scripts

Let me know what you think and if anything should be added/removed.

SargisYonan commented 3 years ago

I'd like to also add the following (although not common or used for CLDR)

U+060C ARABIC COMMA U+061B ARABIC SEMICOLON U+061F ARABIC QUESTION MARK U+0640 ARABIC TATWEEL

We should also come up with which arithmetic operators we'll use

esoleyman commented 3 years ago

For arithmetic operators, we should use the non-Arabic mathematical operators as does most of the world.

SargisYonan commented 3 years ago

That sounds good to me