aws / aws-sdk

Landing page for the AWS SDKs on GitHub
https://aws.amazon.com/tools/
Other
68 stars 13 forks source link

Faster Paging #570

Open djchapm opened 11 months ago

djchapm commented 11 months ago

Describe the feature

Provide pagination metadata so that we may perform retrievals in parallel. i.e. async listObjectsV2, we never know how many pages there are and are unable to pull a page until we know the page before it and a dynamic 'next-token'.

Increase maxKeys limit from 1000 to allow something closer to maybe 10K or more.

Use Case

highly complex system writing streaming data to S3 - on disaster recovery we need to scan S3 objects, identify key patterns and determine where we left off in order to restart at the correct location. Across something like 10Million keys.

Today max request is 1000 keys at a time, and no way to do this in parallel.

Proposed Solution

Or suggest strategies - thoughts:

Other Information

No response

Acknowledgements

AWS Java SDK version used

2.20.79

JDK version used

17

Operating System and version

AMD64/ARM64 architectures

debora-ito commented 11 months ago

@djchapm These are all service-side changes.

I will send an internal request to the S3 team with your feedback. Transferring this to the aws-sdk repo for cross-SDK visibility.

P96288338