pdf2text spusteno, overil jsem, ze txt je v UTF-8.
vygenerovany tyto TXT:
csob.txtcsob1.txt
Spusteni csob2json.rb na windows spadne.
Reading csob.txt
csob2json.rb:111:in block in <main>': incompatible encoding regexp match (UTF-8 regexp with CP852 string) (Encoding::CompatibilityError) from csob2json.rb:110:ineach'
from csob2json.rb:110:in each_with_index' from csob2json.rb:110:in'
Testovano na techto PDF: csob1.pdf csob.pdf
pdf2text spusteno, overil jsem, ze txt je v UTF-8. vygenerovany tyto TXT: csob.txt csob1.txt
Spusteni csob2json.rb na windows spadne. Reading csob.txt csob2json.rb:111:in'
block in <main>': incompatible encoding regexp match (UTF-8 regexp with CP852 string) (Encoding::CompatibilityError) from csob2json.rb:110:in
each' from csob2json.rb:110:ineach_with_index' from csob2json.rb:110:in
na Mac to probehne ok. .txt i .rb jsou v UTF-8.