unidoc / unipdf

Golang PDF library for creating and processing PDF files (pure go)
https://unidoc.io
Other
2.5k stars 249 forks source link

[FEATURE] Concurrent page extraction #361

Open gunnsth opened 4 years ago

gunnsth commented 4 years ago

Is your feature request related to a problem? Please describe. Currently extraction only supports processing pages one by one. It might be more efficient to use multiple go-routines to handle page-by-page.

Describe the solution you'd like Explore what the easiest way to support concurrency in extractor package is.

Describe alternatives you've considered Alternative and currently the best way for concurrency is on a document basis. I.e. one go-routine handling a single document.

Additional context Client's comment

We often deal with documents that are 900+ pages and serially processing these with Unidoc was. Taking a long time and this a lot of money in AWS expenses.