kakaoenterprise / JORLDY

Repository for Open Source Reinforcement Learning Framework JORLDY
Apache License 2.0
359 stars 50 forks source link

R2D2 optimize and benchmark #212

Open kan-s0 opened 2 years ago

kan-s0 commented 2 years ago

Is your feature request related to a problem? Please describe. Currently, the state type stored as a transition in R2D2 is too large as float64. And if the sequence length is lengthened accordingly, the existing buffer size is too large.

Describe the solution you'd like

Describe alternatives you've considered

Additional context