cormac020 / Branching-DQN

I created this for version control and it seems so far to perform better than other branching DQN I have seen
5 stars 1 forks source link

This code seems can not solve the problem when different actions have different scale #1

Open ChenYangZuo opened 2 months ago

cormac020 commented 2 months ago

Thanks for your issue! The agent is mainly trained and tested in the gym environments, especially the continuous control tasks. These environments provide standard interfaces so I just divided the continuous actions into discrete actions with equal length for convenience. I believe this algorithm can adapt to tasks where different actions have different scales, with some slight adjustments in the codes. I'll consider updating my codes later. Thanks again!