Considerit / ConsiderIt

For deliberation and opinion visualization
GNU Affero General Public License v3.0
92 stars 14 forks source link

Proper encoding and data export for Central European languages that use diacritics #182

Closed tkriplean closed 1 year ago

tkriplean commented 1 year ago

Reported by email:

In data exports, the text appears to miss the diacritics/accents that are specific to Slovak language (or a Czech language, Polish, Hungarian... that is, Central European languages). I know, as a lay person, that there are codings in text editors or some other programs, that enable the user to set the character settings, for example, into Windows-Central European language group. Maybe it also could be changed through the optional setting of ANSI, UTF-8 maybe? or some other set of coding that takes into consideration other languages than English.

If it helps you better understand and see, what I mean, I attached a printscreen of the few lines from the file opinions. And, as an example, the line nr. 52 in that file should read as this, if it contained the accent/diacritics:

image

Could you, please, adjust or add the functionality of text in exported data to be in alignment with the language (localisation) of the forum set up by administrators. I think, even admins from e. g. Hungary, Poland and/or other countries whose languages contain specific characetrs/diacritics/accents would find it useful, too.

tkriplean commented 1 year ago

Turns out this was a problem with Excel's reading of accent/diacritics.