advantage-actor-critic Search Results

298 results
for advantage-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PacktPublishing/Hands-On-Intelligent-Agents-with-OpenAI-Gym #24

A2C agent doesn't act

Hi, For some reason when I try to run the trained a2c agent in CARLA it doesn't take any actions, just sits there doing nothing. These are my terminal outputs: (rl_gym_book) amakri@amakri-Zephy…

Amakri1020 updated 5 years ago
5
MorvanZhou/Reinforcement-learning-with-tensorflow #122

hi,你的simply_ppo代码中的几个错误：

1. critic 网络用来计算出v, LOSS函数应该类似dqn用迭代v的方式去逼近真实的v，再用A=Q-V的方式求出A。而你的loss函数是去最小化advantage，怎么可以最小化advantage，我们的目标就是要尽量增大优势函数的。 # critic with tf.variable_scope('critic'): l1 = tf.la…

clicdl updated 5 years ago
1
sfujim/TD3 #7

does overestimate occurs in advantage actor-critic？

ToTheMonn updated 5 years ago
3
PacktPublishing/Hands-On-Intelligent-Agents-with-OpenAI-Gym #6

Question about A3C (ch8)

Hi, I have a question about the plots presented in `Ch8`, in the section of **"Training and testing the deep n-step advantage actor-critic agent"** in the book. The Tensorbroad plots in this se…

AliBaheri updated 5 years ago
26
PacktPublishing/Hands-On-Intelligent-Agents-with-OpenAI-Gym #17

problems of running a2c_agent.py for Carla-v0

Hi, today i have studied the a2c_agent.py of the actor-critic algorithm, i tested it in several simple environment, and i thought this implementation needs millions steps to get the optimal policy. …

fangchuan updated 5 years ago
4
PacktPublishing/Hands-On-Intelligent-Agents-with-OpenAI-Gym #25

Carla agent performance and training time

@praveen-palanisamy thank you for your extremely helpful code base. I have some questions that I hope you could give some insights into: - I noticed that the reward function is the one that was int…

LocLePhuoc updated 5 years ago
4
ikostrikov/pytorch-a3c #60

GAE parameter name should be lambda not tau. And why is defa…

From the end of section 3 in the GAE paper: **High-Dimensional Continuous Control Using Generalized Advantage Estimation** https://arxiv.org/pdf/1506.02438.pdf ``` Taking γ < 1 introduces bias in…

beduffy updated 5 years ago
4
MorvanZhou/Reinforcement-learning-with-tensorflow #73

why random action?

I was going through your code. I noticed this one ![random action](https://user-images.githubusercontent.com/11025093/43618534-df4a1da2-9703-11e8-9ac3-fb0cb9589359.png) Why are you using np.random…

sezan92 updated 5 years ago
7
sfujim/TD3 #9

Would it be possible to use a single network?

Hello, thank you for this great work. I have few questions as I am a newbie in Reinforcement Learning. Would it be possible to use a single network with multi-heads rather than two? I am actually t…

kayuksel updated 5 years ago
1
PacktPublishing/Hands-On-Intelligent-Agents-with-OpenAI-Gym #14

Tensorboard with pytorch

Hello @praveen-palanisamy I went through your book and it's really helping. I have a question regarding "Using TensorBoard for logging and visualizing a PyTorch RL agent's progress" p. 107 (chap …

smiler80 updated 5 years ago
5

上一页 1...24 25 26 27 28 29 30...30 下一页

298 results for advantage-actor-critic

298 results
for advantage-actor-critic