herveus / Morsulus-tools

Morsulus's tool chain and related bits
http://www.morsulus.org
5 stars 3 forks source link

Add support for non-Romanized alternate forms of names #28

Open herveus opened 1 month ago

herveus commented 1 month ago

Add support for capturing non-Romanized form of names. The SQLite database appears to impose no constraints. Lots of implementation details to be sorted out in various places.

herveus commented 1 month ago

Presuming that a "native form" will appear in tag in the XML, the XML extractor will catch that and create a note "(Native form: )".

The long pole is getting UTF-8 out of OSCAR and through the not-JADED process that does not yet exist.

Changing the encoding for oanda.sca.org to UTF-8 overall will be necessary. The various pages and scripts probably need headers updates.