-
Hi!
Let's bring the reinforcement learning course to all the Russian-speaking community 🌏
Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…
-
Hi, thank you very much for sharing the code. It is very helpful.
I have a question about the meaning of constant "3". In many places of the codes, "3" is directly used to define the parameters. s…
-
After the release of tensorflow 2.0, there are several enhancements that has been made on both of the versions. Some functions are taken out of 1.x and some are deprecated and replaced in tensorflow 2…
-
# Deep Q-Network (DQN) on LunarLander-v2 | Chan`s Jupyter
In this post, We will take a hands-on-lab of Simple Deep Q-Network (DQN) on openAI LunarLander-v2 environment. This is the coding exercise fr…
-
During literature review for my Master's thesis, I came across your paper regarding Deep Q-Learning based DTC of PMSM. My idea was to work on motor control algorithm by using GEM library. I am trying …
-
Hi!
Let's bring the reinforcement learning course to all the Korean-speaking community 🌏 (currently 9 out of 77 complete)
Would you want to translate? Please follow the 🤗 [TRANSLATING guide](ht…
-
Pose a question about one of the following articles:
“[Human-level control through deep reinforcement learning](https://www.nature.com/articles/nature14236)” 2015. V. Mnih...D. Hassabis. Nature 51…
-
Post your questions here about: “Reinforcement Learning” and “Deep Reinforcement Learning”, Thinking with Deep Learning, Chapters 15 & 16
-
I see that you are using a 0 vector for the rewards, and only updating the value that corresponds to the action here:
https://github.com/AxiomaticUncertainty/Deep-Q-Learning-for-Tic-Tac-Toe/blob/c5c0…
-
[paper](https://arxiv.org/pdf/1502.05477.pdf)
## TL;DR
- **I read this because.. :** CS285 기말과제
- **task :** reinforcement learning
- **problem :** 이론적으로 무조건 성능이 개선되는 policy update 방식이 있을까…