yob / pdf-reader

The PDF::Reader library implements a PDF parser conforming as much as possible to the PDF specification from Adobe.
MIT License
1.81k stars 271 forks source link

Getting mangled characters / mixed words #354

Open MJCune1 opened 3 years ago

MJCune1 commented 3 years ago

Hi,

I’m trying to extract text with last version of the gem (2.4.2) but I’m getting mangled characters / mixed words, it seems columns are getting mixed or joined

PDF file:

tabla

Console result:

[2] pry(main)> reader.pages.first.text
=> "1         25.660.774-SALAS                                     BRUNO"

Thanks in advance