Backpropagation through time for PPO

Replicable-MARL / MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

https://marllib.readthedocs.io

MIT License

890 stars 142 forks source link

Backpropagation through time for PPO #213

Open fulacse opened 9 months ago

fulacse commented 9 months ago

PPO + LSTM have a extral hyperparameter what is bptt horizon. Is possible I set up it?

Theohhhu commented 8 months ago

Seems like in RLlib version 1.8.0, there is no native support for configuring the BPTT when using an RNN structure as the model base. Maybe consider trying the latest version of Ray for potential updates on this?