jwilk-archive / ocrodjvu

OCR for DjVu
GNU General Public License v2.0
44 stars 19 forks source link

djvused script without escaping Unicode characters #13

Closed jwilk closed 9 years ago

jwilk commented 9 years ago

Issue reported by @jsbien:

This is possible for some time already, cf. http://sourceforge.net/p/djvu/discussion/103285/thread/844fb32c/.

jwilk commented 9 years ago

I will have to add the missing feature to python-djvulibre first.

jwilk commented 9 years ago

In the mean time, you can use this one-liner to unescape non-ASCII characters:

perl -pe 's/(\\\D)|\\(\d{3})/$1 or chr(oct($2))/ge'
jwilk commented 9 years ago

python-djvulibre 0.4 supports printing S-expressions with unescaped non-ASCII characters.

jwilk commented 9 years ago

Fixed in 7804c5fe09ff460173206bceb6b8b1fb1b54fc08.

jwilk commented 9 years ago

Fixed in ocrodjvu 0.9.