schierlm / BibleMultiConverter

Converter written in Java to convert between different Bible program formats
Other
124 stars 33 forks source link

Charset lost from SWORD to YCHPalmBible #36

Closed OSDVF closed 3 years ago

OSDVF commented 3 years ago

I'm trying to convert CzeCEP module from SWORD to YCHPalmBible and some chars as 'ů' or 'č' are being lost even that there is a ENCODE="Cp1252" attribute in the PARSERINFO tag. I tried to converting whole SWORD module (which is in UTF-8) to windows-1252 with no success. Any soultions?

schierlm commented 3 years ago

Thank you for your bug report.

The characters get lost as the encoding used by my exporter is cp1252 which does not include those characters.

All Palm models I ever owned were European EFIGS models (supporting English French Italian German Spanish) and they did not support any other character set than cp1252 (which also means you can never see an ů or č on them) so I could never implement any other charsets.

I guess that Czech PalmOS might use cp1250 encoding, but I am not sure.

If you can provide insight about PalmOS models that support Czech language (or could even dump a ROM of one of them that could be used in Palm OS Emulator), I might implement other charsets.

You may also try changing the charset name in these two lines, but no guarantee it will work on your Palm device.

OSDVF commented 3 years ago

I'll try changing the two lines and we'll se... Thank you

schierlm commented 3 years ago

Did it solve the issue for you? If not, the ROM dumper app is still available at https://palmdb.net/app/rom-dumper, so you could dump your ROM and I'll try in the emulator what charset I need to use.

OSDVF commented 3 years ago

Yeah, it works now. (Btw. I use BibleConverter additionaly to convert to binary PDB format)

schierlm commented 3 years ago

Closing issue :-)