Closed ErikPasztor closed 3 years ago
I tried to follow the example code for HER, but I feel like I'm missing something. I tried using buffer_size=env_max_steps+1, but that didn't help. Can I fix this? Or is this a bug?
Hello,
it seems that you are using a custom environment, I would recommend you to fill up the custom env template ;)
I would also recommend you to use the TimeLimit
wrapper from gym to avoid any issue regarding the timeout.
Finally, make sure to the use the master version of SB3 (cf doc), we recently fixed a bug in https://github.com/DLR-RM/stable-baselines3/issues/234
Hi, using TimeLimit wrapper did help. I ended up specifying the maximum number of steps for each wrapper, but I got it to work. Thank you for your help.
Question
I'm trying to teach a custom environment using TD3+HER (relevant code below). model.learn() produces an error
I tried to follow the example code for HER, but I feel like I'm missing something. I tried using buffer_size=env_max_steps+1, but that didn't help. Can I fix this? Or is this a bug?
Code
Checklist