Closed DJT777 closed 1 month ago
@araffin If the question is duplicate can you point me in the direction to the discussion about implementation of LSTM into SAC?
Related to https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/201 and https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/222 (and maybe other I forgot)
Basically just wondering if any use of an LSTM at all in a custom policy network would be supported or not.
No lstm at all is supported currently, you would need to fork SB3 too (see related issues to have a starting point).
@araffin awesome, thank you!
❓ Question
Hello!
I was reading the docs here https://stable-baselines3.readthedocs.io/en/master/modules/sac.html#notes and it notes that it does not accept recurrent policies. If I implement my own custom network into SAC then will it not run if I use an LSTM to encode the observation space before a forward pass to something like a MLP based critic or actor?
Basically just wondering if any use of an LSTM at all in a custom policy network would be supported or not. The docs are indicating that it's not, and I want to make sure: https://stable-baselines3.readthedocs.io/en/master/modules/sac.html#notes
Is that saying the SAC implementation won't support ANY recurrent neural networks or it doesn't support the available recurrent policies developed and available in the library.
Checklist