Describe alternatives you've considered (Optional)
We could implement an S3 buffer similar to the Kafka buffer that does not require splitting the pipeline. But, creating this would be quite a bit faster.
Also, I think we should leave room for a possible S3 buffer that is implement. My proposal is to alter the name of this buffer to make it distinct from an S3 buffer. And also to avoid confusing with other buffers such as Kafka. Thus, I called this pipeline_s3.
One alternative to changing the name is to use a flag instead - split_pipeline: true or asynchronous_buffer: true.
Is your feature request related to a problem? Please describe.
For workloads that are smaller and want durability, using S3 as a buffer can be a good solution.
Describe the solution you'd like
Data Prepper already has a few things that we can combine to create an S3 buffer.
I propose that we have a new buffer -
pipeline_s3
which is implemented only as a pipeline transformation.This would transform into:
Describe alternatives you've considered (Optional)
We could implement an S3 buffer similar to the Kafka buffer that does not require splitting the pipeline. But, creating this would be quite a bit faster.
Also, I think we should leave room for a possible S3 buffer that is implement. My proposal is to alter the name of this buffer to make it distinct from an S3 buffer. And also to avoid confusing with other buffers such as Kafka. Thus, I called this
pipeline_s3
.One alternative to changing the name is to use a flag instead -
split_pipeline: true
orasynchronous_buffer: true
.Additional context
N/A