Add a client-side param called splitPdfPageRange which takes a list of two integers, [start, end]. If splitPdfPage is true and a range is set, slice the doc from start up to and including end. Only this page range will be sent to the API. The subset of pages is still split up as needed. If [start, end] is out of bounds, throw an error to the user.
Testing
Check out this branch and set up a request to your local API:
Test out various page ranges and confirm that the returned elements are within the range. Invalid ranges should throw a useful Error (pages are out of bounds, or end_page < start_page).
To match the python feature: https://github.com/Unstructured-IO/unstructured-python-client/pull/125
New parameter
Add a client-side param called
splitPdfPageRange
which takes a list of two integers,[start, end]
. IfsplitPdfPage
istrue
and a range is set, slice the doc fromstart
up to and includingend
. Only this page range will be sent to the API. The subset of pages is still split up as needed. If[start, end]
is out of bounds, throw an error to the user.Testing
Check out this branch and set up a request to your local API:
Test out various page ranges and confirm that the returned elements are within the range. Invalid ranges should throw a useful Error (pages are out of bounds, or end_page < start_page).