-
https://datawhalechina.github.io/easy-rl/#/chapter7/chapter7
Description
-
class M1(DQNConfig):
backend = 'tf'
env_type = 'detail'
action_repeat = 1
class M2(DQNConfig):
backend = 'tf'
env_type = 'detail'
action_repeat = 4
I use
python m…
-
Hi Denny,
Thanks for this wonderful resource. It's been hugely helpful. Can you say what your results are when training the DQN solution? I've been unable to reproduce the results of the DeepMind p…
-
您好 我现在想采用人工对战的数据用于加速收敛,请问这里人工对战的话,训练时候的mcts_probs_batch概率该如何设定呢 ,可否让采取当前action的概率为1 其他为0?
-
Hi, I'm using trying out this code in windows. I always get this error : ERROR: (localhost:2000) failed to read data: timed out. This is the error trace.
runfile('C:/Users/cvaram/Documents/CARLA_0.9.…
-
### Search before asking
- [x] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues.
### Ray Component
Ray Core, Ray Tune
### What happened + What you ex…
-
你好,非常感谢您能分享代码,但是我们在训练结果性能较低,请问可以提供一下您训练好的模型吗?想做进一步的测试,非常感谢。
-
Not sure if you are interested but I have written a tutorial for building a basic agent:
https://medium.com/@skjb/building-a-basic-pysc2-agent-b109cde1477c
https://medium.com/@skjb/building-a-smar…
-
## 🚀 Feature
The RL implementations added do not have the num_workers option. I have a feeling this is because the code doesn't support a shared replay buffer.
### Motivation
Adding this would e…
-
After value, logit, (hx, cx) = model((Variable(state.unsqueeze(0)),(hx, cx))) in train.py, the program doesn't go on. Do you have any idea?