norbusan / kobo-ja-dict-enhance

Enhance built-in Japanese dictionaries from Kobo with English definitions
GNU General Public License v3.0
427 stars 2 forks source link

"does not map to Unicode" error #5

Open jsteel44 opened 3 years ago

jsteel44 commented 3 years ago

Hi,

I wonder if you can advise what might be the issue here:

$ perl ./enhance-dictionary.pl --dict=edict2 --input=dicthtml-ja.zip
Using the following dictionaries as source for translations: edict2
loading edict2 type from edict2 ... euc-jp "\xE5" does not map to Unicode at ./enhance-dictionary.pl line 266.
euc-jp "\x88" does not map to Unicode at ./enhance-dictionary.pl line 266.
euc-jp "\x96" does not map to Unicode at ./enhance-dictionary.pl line 266.
euc-jp "\xE3" does not map to Unicode at ./enhance-dictionary.pl line 266.
[snip]
euc-jp "\x9D" does not map to Unicode at ./enhance-dictionary.pl line 292, <$wf> line 150045.
euc-jp "\xAE" does not map to Unicode at ./enhance-dictionary.pl line 292, <$wf> line 150045.
^C

I'm on Arch Linux with perl v5.32.1. I've left it running for a short while and it just repeats this error and I end up cancelling. No output files are created.

Many thanks

kvwoerden commented 2 years ago

Hi @jsteel44, I received the same error as you did when using the edict2 file included in the git repository.

Using instead the edict2 file from the edict home page worked for me: edict2.gz

After cloning the repository, I locally removed the edict2 file from the repository and replaced it with the unzipped file from the above link.