aws-samples / amazon-textract-idp-cdk-constructs

MIT No Attribution
27 stars 13 forks source link

DocumentSplitter to support different sizes #73

Closed schadem closed 1 year ago

schadem commented 1 year ago

DocumentSplitter only splits into 1 page documents right now. Would be awesome if that would configurable to support splitting documents that are too large for Textract to process into smaller chunks. For example a 5000 page PDF document into one 3000 page and one with 2000 pages.