jrmuizel / pdf-extract

A rust library for extracting content from pdfs
396 stars 78 forks source link

unexpected smask type 554 0 R #27

Open grindfuzz opened 3 years ago

grindfuzz commented 3 years ago

Hi, at first thank you for very useful crate! :)

I've tried to convert PDF file ( https://arxiv.org/pdf/2108.11950v1.pdf ) to text and got following exception:

thread 'main' panicked at 'unexpected smask type 554 0 R', /home/user/.cargo/git/checkouts/pdf-extract-1e3ad5dc34c14d18/e03d663/src/lib.rs:1190:24

It seems that the problem is somewhere at page 7 (I have text output from pages 1-6).

grindfuzz commented 3 years ago

Probably could be solved by #25 (applying the change from that PR solves the issue)