Closed tty666 closed 3 weeks ago
But maybe it's a good addition and sometimes replacement for LSTM/RNN ?
that would be more for SB3 contrib I guess. And without any benchmark, it's hard to say if it's a good addition. For instance, for recurrent PPO: https://wandb.ai/sb3/no-vel-envs/reports/PPO-vs-RecurrentPPO-aka-PPO-LSTM-on-environments-with-masked-velocity--VmlldzoxOTI4NjE4
(the gain is marginal with respect to frame stacking on several envs but it is substantial on others like lunar lander without velocity)
Closing as no benchmark was provided to support the feature request. Feel free to re-open if you manage to have some quantitative results.
🚀 Feature
Hello guys, After watching this video : https://www.youtube.com/watch?v=WoLlZLdoEQk I had the idea to extend the NatureCNN to NatureCTN1D this way :
It's a quick addition I am pretty sure I could work more on it... But maybe it's a good addition and sometimes replacement for LSTM/RNN ? I am using SiLU in my context but in a more "general" way ReLU could be used as activation function ... What do you think Should I propose it as a pull request for the contrib repo ? Or it doesn't make any sense for you ?
Motivation
The ReccurentPPO wasn't fitting my need so I did some research about other possibilities ...
Pitch
No response
Alternatives
No response
Additional context
No response
Checklist