-
I've trained the model for 50 total episodes. However, when I run the last code cell, the action is always the same. I've printed Qs and the action, and the action is always [0 0 0 0 0 0 1 0]. The age…
-
Saving the q-values learned by our model. Finally plotting it using the matplotlib and in order to see the behaviour when the values were tweaked
-
正在学习您的TSP教程,遇到一个小问题,请您帮忙:
代码块1:
for idx in range(K):
#Train with Q-learning
rewards, Q_q = train(env, qpolicy, total_episodes)
#rewards, Q_q = train(env, on_policy, total_episodes)
qlear…
-
i found a bug in the q-learning code...... it runs but sometimes while running just stops and gives a key error
-
Hi Haarnoja,
Thanks a lot for maintaining the amazing repo!
I feel a little confused about the implementation of SVGD in soft-q learning.
At
https://github.com/rail-berkeley/softlearning/blob/0…
-
Hi,
Would it be possible to extent the q-learning implementation to use a function approximator instead of a state-action table, and include a sample application?
This would be much more useful for r…
-
**This is a(n):**
- [x] New algorithm
- [ ] Update to an existing algorithm
- [ ] Error
- [ ] Proposal to the Repository
**Details:**
[Q Learning](https://en.wikipedia.org/wiki/Q…
-
https://arxiv.org/abs/1611.01626
-
-
# Q-learning for beginners | Maxime Labonne
Train an AI to solve the Frozen Lake environment
[https://mlabonne.github.io/blog/reinforcement%20learning/q-learning/frozen%20lake/gym/tutorial/2022/02/1…