documentcloud / docsplit

Break Apart Documents into Images, Text, Pages and PDFs
http://documentcloud.github.com/docsplit/
Other
833 stars 214 forks source link

Adding fail-through to Poppler in ImageExtractor to handle a failing PDF with Quartz annotations #126

Closed sergeyk closed 7 years ago

sergeyk commented 9 years ago

First added a failing test, then a fix for it. Note that the poppler (pdftocairo) fail-through doesn't respect the memory limit, but does respect resolution and output format.