cormac020 / Branching-DQN

I created this for version control and it seems so far to perform better than other branching DQN I have seen
5 stars 1 forks source link
bdqn branching-dqn reinforcement-learning

Branching DQN

Branching DQN implementation with pytorch based on https://github.com/seolhokim/BipedalWalker-BranchingDQN. It is also sufficiently capable of showing (almost) optimal movements after 1000 episodes in BipedalWalker-v3 environment. For better performance in BipedalWalker-v3, I use some tricks mentioned in https://zhuanlan.zhihu.com/p/409553262. However it seems fine in other environments without these tricks. :)

Dependencies

python==3.9.10
gym==0.18.3
torch==1.13.1
Other versions may also work well. It's just a reference.

Structure

/data: contains results of training or testing, including graphs and videos
/model: contains pre-trained models

Train

use:

python train.py

Test

use:

python enjoy.py

Performance

Scores in Training:
Score in 2000 episodes
Trained Model:
Visual performance