Support for batch append while adding data

There are a number of reasons why we chose not to expose a batched API for the writers and readers. The main is that, unlike NN compute the performance of a system like Reverb isn't limited by memory bandwidth and as such batching frames in storage and during I/O is not always advantageous. In fact:

Oftentimes chunking data needs to be flexible to minimize the number of RPCs used to send data, minimize I/O latency, and ensure we keep RPC message sizes in the range that's optimal for gRPC.
Sometimes the data needs to be variably shaped (e.g. the new Trajectory{Writer,Sampler} allows variable-length trajectories)

In these cases, we prefer the Writers to handle one "environment" at a time, and perform batching/chunking of data inside Reverb's clients and server. Similarly when we read from the server we perform custom control flow to minimize latency in the sampler and rely on tf.data to batch data from multiple sampler threads. As we improve the system, the control flow algorithms improve and the performance improves even if the API does not expose batching.

The current suggestion is to have a separate writer for each env, and use tf.data with interleaving to sample in parallel and batch data. For the latter, the ACME reverb dataset uses our currently recommended best practices. An example of the new TrajectoryWriter is also available in ACME here.

If you have particular performance issues please let us know by filing a separate issue so that we can help you maximize your throughput / minimize latency for your case.

google-deepmind / reverb

Support for batch append while adding data #52