This issue is created to split functionality of a replay buffer into two traits, one for pushing samples from environments, the other for making batches to train agents.
In addition, this issue removes the trait bound of Batch: BatchBase in ReplayBufferBase. This modification allows replay buffers implementing this trait to emit batches that does not have the form (o, a, o', r, is_done). Instead, agents should specify trait bounds for the type of samples taken from replay buffers.
This issue is created to split functionality of a replay buffer into two traits, one for pushing samples from environments, the other for making batches to train agents.
In addition, this issue removes the trait bound of
Batch: BatchBase
inReplayBufferBase
. This modification allows replay buffers implementing this trait to emit batches that does not have the form(o, a, o', r, is_done)
. Instead, agents should specify trait bounds for the type of samples taken from replay buffers.