Eliminate possibility of OutOfMemory errors from scaling Kinesis Shards

snowplow / snowplow-s3-loader

Mirrors a Kinesis stream to Amazon S3 using the KCL

42 stars 38 forks source link

Currently the S3 Loader buffers operate on a per-shard basis. This means that if your rotation buffer is 64mb each shard allocated to the consumer can consume this much memory. If the number of shards suddenly scales you run the risk of needing not 64mb of memory for this buffer but instead N x MaxByteBuffer - let alone overhead for the JVM and processing in general being done.

This behavior makes it impossible to auto-scale consistently as you never know how much memory an individual consumer might end up needing.

snowplow / snowplow-s3-loader

Eliminate possibility of OutOfMemory errors from scaling Kinesis Shards #250