Getting wrapper warning during training and then training does not work

Stable-Baselines-Team / stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

https://sb3-contrib.readthedocs.io

MIT License

504 stars 175 forks source link

Getting wrapper warning during training and then training does not work #143

Closed gowthamnatarajan closed 1 year ago

gowthamnatarajan commented 1 year ago

Here is the code:

The training begins, and initially is picks the actions randomly as expected. But after a few seconds I get the following warning:

And then after this it always picks just one action over and over again and nothing happens. It no longer picks actions in random so it does not learn anything. What could be causing this? The same error happens even with the regular PPO algorithm in the sb3 package as well.

araffin commented 1 year ago

Hello, please fill the issue template completely (minimal working example and markdown code block are missing). This is the second time I'm asking you (https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/142#issuecomment-1412002247), if you continue to not follow basic rules to ask for help, i'll have to close any new issue without reading.

PS: i need to add custom gym env issue template in sb3 contrib, but can find/copy it from sb3