ParadoxGameConverters / Linguistics

Linguistic configurations used by several converters
MIT License
1 stars 0 forks source link

File format #2

Open IohannesIohannium opened 2 years ago

IohannesIohannium commented 2 years ago

How the dictionary file should look like (example with table):

English en_article en_class en_adj French fr_article fr_class fr_adj German de_article de_class de_adj Spanish es_article es_class es_adj Russian ru_class ru_adj Polish pl_class pl_adj pl_adj_a Korean Chinese Czech cz_class cz_adj cz_adj_a Italian it_article it_class it_adj Hungarian hu_class Dutch nl_article nl_class nl_adj Portuguese pt_article pt_class pt_adj
England   en_s English Angleterre L' fr_fsav anglaise England   de_n englisch Inglaterra La es_fsa inglesa Англия ru_f английск Anglia pl_fs angielsk angielsc 잉글랜드 英国 Anglie cz_fs anglick angličt Inghilterra L' it_fsav inglese Anglia hu_s Engeland   nl_n Engels Inglaterra pt_fsn inglesa
United States The en_pl American États Unis Les fr_mpav américaine Vereinigte Staaten Die de_pl amerikanisch Estados Unidos Los es_mpa americana Соединённые Шта́ты ru_pl американск Stany Zjednoczone pl_mp amerykańsk amerykańsc 미국 美国 Spojené Státy cz_mp americk američt Stati Uniti Gli it_mpac americana Egyesült Államok hu_p Verenigde Staten Die nl_pl Amerikaans Estados Unidos Os pt_mpa americana
Chicago   en_s Chicagoan Chicago   fr_fsn chicagoise Chicago   de_n chicagoerisch Chicago   es_fsn chicagüense Чикаго ru_m чикагск Chicago pl_ns chicagowsk chicagowsc 시카고 芝加哥 Chicago cz_ns chicagsk chicagšk Chicago   it_fsn cicaghese Chicago hu_s Chicago   nl_n Chicagoer Chicago pt_fsn chicaguense

Explanation:

IohannesIohannium commented 2 years ago

Outputs for CK3

Outputs for EU4

Outputs for Vic2

Outputs for HoI4

IohannesIohannium commented 2 years ago

Special things:

How the dictionary file should work:

Matching is case-insensitive, diacritic-sensitive.

Output will be restricted to the languages a game actually has (we'll skip Vic2 Finnish column because Finnish is impossible to handle) and will not overwrite vanilla localisations (if a game already has RUS_ADJ as localized, the converter will NOT output RUS_ADJ)

IohannesIohannium commented 2 years ago

Grammar category will be used to define:

Every other grammar shenanigan such as adjective declension (français, française, ...), polity name's declension (Франция, Франции, ...) is to be done separately through the use of respective adj_rules.txt and decl_rules.txt files

IohannesIohannium commented 2 years ago

As an update, most columns that have articles separate have been removed, as articles can be inferred from categories anyway. Czech, Hungarian and Dutch have also been dropped.

IohannesIohannium commented 2 years ago

DROPPED: now resorting to simply giving the converter grammatical categories for each string (e.g. telling the converter that the French string Géorgie is in category fsac, that the Spanish string Puerto Rico is in category msn, etc.)