StepNeverStop / RLs

Reinforcement Learning Algorithms Based on PyTorch
https://stepneverstop.github.io
Apache License 2.0
449 stars 93 forks source link

sac 在LunarLander-v2上的使用有问题, #44

Closed wkhawyha closed 3 years ago

StepNeverStop commented 3 years ago

This problem is correlated with gradient backward and I've fixed that at commit ab5fffc

StepNeverStop commented 3 years ago

feel free to reopen this.