Mercury13 / unicodia

Encyclopedia of Unicode characters
https://mercury13.github.io/unicodia/
GNU General Public License v3.0
93 stars 4 forks source link
character-table encyclopedia unicode

What is Unicodia?

It is a simple Unicode encyclopedia and the most comprehensive character map ever. Right now Windows only.

Lifecycle phase: 5 (production/stable). Minor troubles with sustainability, but generally survived three releases of Unicode. And will survive the fourth if I survive.

I’m in Ukraine torn with war, so I’ll release often. See “war release” tag in Issues.

How to translate?

Language policy

Common. No war jargon. Describe 2022 war as neutral as possible. Every lingua franca (English, Russian, French) in its international form. Make examples as patriotic as possible for language we’re writing in: the same letter is Russian and Ukrainian in respective L10n’s. And English if the same phenomenon exists in English language. Apostrophe is U+2019.

English. International: truck > lorry, petrol > gas. Prefer British form if both are good. Punctuation around quotes is British/international: it’s inside quotes if it’s a part of “phrase being quoted”.

Russian. Ё is mandatory. No grammatical concessions to Ukrainian.

(May apply to new languages as well.) Adjectives like Georgian may agree to script (письменность, female in Russian), or to language (язык, male). The rules are…

Ukrainian. Avoid 2019 reform, earlier changes (e.g. frequent Ґ in loanwords) are embraced. Describe topics sensitive to nationalists as delicately as possible.

New languages.

How to build?

How to develop?

See develop.md.

Compatibility and policies

Platforms

Win7/10/11 x64 only. Rationale:

Tofu/misrenderings

Update Unicode

Wartime: as soon as base arrives, and release date is frozen, even on alpha review stage

Peacetime (probably): stable release + some big font covering a major set arrives. Han too if the coverage is really high

Emergency releases of a few characters (e.g. currency, Japanese era): instantly, even if they are tofu

Fonts

Fonts are always updated to release versions. Font is updated to alpha/beta if fixes a major misrender, and/or professionally implements a new character.

Naming: Noto if tables and existing glyphs are surely untouched; Uto otherwise.

These fonts are taken to Unicodia without author’s consent:

There’s no special policy for CJK Han ideographs, but mainland Chinese is preferred. All such fonts tend to recreate hypothetical mainland ideographs even if unseen in Chinese bases: SimSun does it for 25 of 100 ideographs from CJK A block. Anyway, Unicodia will never be a good ideograph guide, everything I write about ideographs I suck from other sources.

Data

Data is as neutral as possible. Examples.

Future functionality