laboroai / border

A reinforcement learning library in Rust
Apache License 2.0
40 stars 8 forks source link

[v0.0.6] Split functionality of ReplayBufferBase into two traits #54

Closed taku-y closed 1 year ago

taku-y commented 2 years ago

This issue is created to split functionality of a replay buffer into two traits, one for pushing samples from environments, the other for making batches to train agents.

In addition, this issue removes the trait bound of Batch: BatchBase in ReplayBufferBase. This modification allows replay buffers implementing this trait to emit batches that does not have the form (o, a, o', r, is_done). Instead, agents should specify trait bounds for the type of samples taken from replay buffers.