Check that the code implementation is accurate and reasonable - Githubissues

StepNeverStop / RLs

Reinforcement Learning Algorithms Based on PyTorch

https://stepneverstop.github.io

Apache License 2.0

448 stars 93 forks source link

Check that the code implementation is accurate and reasonable #34

Open StepNeverStop opened 3 years ago

StepNeverStop commented 3 years ago

[x] check and fix C51 [deaab73]
[x] check qrdqn [deaab73]
[ ] check iqn
[ ] check and fix Rainbow
[ ] check on-policy buffer sampling
[ ] check function discounted_sum
[ ] check function calculate_td_error
[ ] checke whether works well when training with visual input
[ ] fix TRPO that step_size sometime be nan
[ ] check vdn and qmix

StepNeverStop commented 3 years ago

[x] 检查将代码中关于运算维度的选择(dim/axis)把能设置为-1的都设置为-1。

StepNeverStop commented 3 years ago

[x] 校正RNN隐状态在使用探索策略时的迭代更新 abf6b0a
[x] 实现按策略与环境交互的间隔更新策略 abf6b0a