UCREL / Multilingual-USAS

Lexicons for the Multilingual UCREL Semantic Analysis System
Other
39 stars 13 forks source link

Language folder re-naming / restructuring from language names to language codes #6

Open apmoore1 opened 2 years ago

apmoore1 commented 2 years ago

Change the name of language folder from their language names to their BCP 47 language codes, an example would be to change the folder named French to fr as fr is the BCP 47 language code for the French language, as can be found through this BCP 47 code lookup tool.

The main idea behind this is that BCP 47 language codes are standardised compared to language names. Language names are different in different languages, but the BCP 47 codes are the same in all languages. The BCP 47 codes also take into account a range of language and dialects and include the script of the language in the code if the script is not the default script for that language e.g. Latn script is the default for French therefore not included in the code, but if the lexicon was written in a different script like Hani the code would take this into account: fr-Hani

apmoore1 commented 2 years ago

An example of a repository that names their language folders by language codes is the Universal Dependency documentation repository