-
_migrated from Trac, where originally posted by **clange** on 29-Aug-2010 11:56am_
In the following example for parallel markup/literate programming, we have a term (definiendum), which is defined by…
-
别人很多A2C是一步一更新,你这个是每一轮过后总的更新一次
-
Hi,
Support for MultiDiscrete and MultiContinuous action spaces is not built for A2C implementation.
Does anyone know which algorithm can help me with large discrete action spaces?
Thanks!
-
hI~,all:
WHEN testing ,we can set the fps by [--fps FRAMERATE] in command line,
but now i want to change the fps when training, how ?
I tried modify the vizdoom_wrapper.py and change the fps from …
-
보통 A3C같은 알고리즘들은 게임용으로 제작됐는데요,
Policy gradient말고 다른 알고리즘 적용시 policy_learner, policy_network 부분만 수정해주면 될까요?
어떤식으로 적용할지 너무 어렵네요
-
여유가 많지 않으므로 일주일 이내에 보는 것이 목표
ghost updated
5 years ago
-
I'm training a A3C these days, but the NN always take the same action, after some steps.
The game I train for is similar to playing Go. There will be few reward in the short term. So it hard to learn…
-
I'm using the following code in my terminal:
python train.py --action-space complex --environment SuperMarioBros-1-1-v0 --transfer
using the SuperMarioBros-1-1-v0.dat file provided by you in the…
-
I suggest you use CNN+LSTM Model to instead of old model
-
你好,请问一下这个代码有相关的论文吗,采用的是哪一种深度学习算法。