sac 在LunarLander-v2上的使用有问题，

StepNeverStop / RLs

Reinforcement Learning Algorithms Based on PyTorch

https://stepneverstop.github.io

Apache License 2.0

449 stars 93 forks source link

Closed wkhawyha closed 3 years ago

StepNeverStop commented 3 years ago

This problem is correlated with gradient backward and I've fixed that at commit ab5fffc

StepNeverStop commented 3 years ago

feel free to reopen this.