Got some tables inside pdf I really needed to parse (or 100 hours of monkey job)
It's impossible without passing -layout option to the pdf parser.
This patch introduces the 'pdf_opts' param, and works as expected: https://github.com/documentcloud/docsplit/pull/114
Got some tables inside pdf I really needed to parse (or 100 hours of monkey job) It's impossible without passing -layout option to the pdf parser. This patch introduces the 'pdf_opts' param, and works as expected: https://github.com/documentcloud/docsplit/pull/114
Just found this one too: https://github.com/documentcloud/docsplit/pull/132