sebtheiler / tutorials

All of the code for my Medium articles
https://medium.com/@sebastiankt9
MIT License
134 stars 107 forks source link

looks like target-network and online-network gets updated at the same frequency UPDATE_FREQ #6

Open anirbanchowdhury9862 opened 4 years ago

anirbanchowdhury9862 commented 4 years ago

looks like target-network and online-network gets updated at the same frequency UPDATE_FREQ

sebtheiler commented 4 years ago

Thank you for spotting this! I've just made the fix in a local file, and am currently testing it, with the target-network being updated every 10,000 steps. I'll get back to you when it's done.

sebtheiler commented 4 years ago

I apologize that this has taken me so long, the past few months have been hectic. I've created a new branch dqn-fix with the changes. I haven't had time to thoroughly test the results and make sure the DQN obtains a similar score, but the code is there, and from my testing so far, it's working as expected. I'll be away from my desktop for a month or so and won't be able to do any more testing for the moment, which is why I'm uploading the files now. I'll leave this issue open until I can test the code to completion.

anirbanchowdhury9862 commented 4 years ago

Great.