SEMICeu / SDG-sandbox

The SDG Sandbox creates a space for the review of data models produced by WP4 - Data semantics, formats and quality - in the context of the preparatory work for the Single Digital Gateway Regulation.
14 stars 9 forks source link

Vehicle Registration: Cyrillic addresses #6

Closed roefie64 closed 3 years ago

roefie64 commented 4 years ago

Some countries in the EU use cyrillic script for names and addresses on their vehicle registration certificates and in their register. For the Vehicle Registration evidence it would be great to add extra fields to hold both the cyrillic as the latin data.

sethvanhooland commented 4 years ago

Thank you for your contribution Roelof, as it's also relevant for the work on other evidences. We'll get back in touch to discuss how we can address the issue you raised.

barthanssens commented 4 years ago

Would it be an option to allow for multiple occurrences of the attributes but with a different xml:lang, to specify multiple languages/scripts ?

And preferably to make it mandatory to (either on the root element or on the various elements) explicitly specify the language ? Using an explicitly defined language may help other tools and processes (e.g. machine translation)

(EU also has Greek script, by the way)

roefie64 commented 4 years ago

It might indeed be an option to have multiple occurances for languages but this is more about the script used then about the language. We need at least one in latin alphabet to be able to store and understand it in most of the countries in Europe. The origanal script can be used to check written documents and for addressing information letters.

makxdekkers commented 4 years ago

Please see the note on multilinguality with more details on how this could be done.

dechden commented 4 years ago

We would suggest to use following authority tables:

pfragkou commented 4 years ago

see also http://cldr.unicode.org/index/cldr-spec/transliteration-guidelines and https://ec.europa.eu/translation/greek/guidelines/el_guidelines_en.htm