tmeinlschmidt / csob-pdf-to-json

CSOB pdf vypis do JSON
1 stars 0 forks source link

Chyba pri zpracovani TXT #1

Closed michalblaha closed 7 years ago

michalblaha commented 7 years ago

Testovano na techto PDF: csob1.pdf csob.pdf

pdf2text spusteno, overil jsem, ze txt je v UTF-8. vygenerovany tyto TXT: csob.txt csob1.txt

Spusteni csob2json.rb na windows spadne. Reading csob.txt csob2json.rb:111:in block in <main>': incompatible encoding regexp match (UTF-8 regexp with CP852 string) (Encoding::CompatibilityError) from csob2json.rb:110:ineach' from csob2json.rb:110:in each_with_index' from csob2json.rb:110:in

'

na Mac to probehne ok. .txt i .rb jsou v UTF-8.

michalblaha commented 7 years ago

pokud se spusti s parametrem -EUTF-8, pak to projde OK.