actor-critic-algorithm Search Results

750 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openai/gym #1267

Value Error "Can not squeeze" for CarRacing-v0 environment

I have tried running the ppo, ddpg, and vpg for the CarRacing-v0 and continuously receive the same ValueError : ValueError: Can not squeeze dim[1], expected a dimension of 1, got 96 for 'v/Squeeze'…

Bhaney44 updated 5 years ago
23
keras-rl/keras-rl #74

DDPG example gives Theano AssertionArror with dropout

If I take the DDPG example and add a dropout layer to either the actor or critic model, I get an AssertionError from Theano (but not Tensorflow): ``` Traceback (most recent call last): File "dd…

jkleint updated 5 years ago
2
keras-rl/keras-rl #120

Training is not using 100% CPU

I am running the examples on my Ubuntu machine Intel® Core i7-4770K CPU @ 3.50GHz with 4 cores. During the entire training, only ~25% of the CPU is used. Which means it is running on only one core. Am…

codetiger updated 5 years ago
2
kairproject/schedule #20

[30분] DDPG from Demonstration 관련논문 소개

1. [Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards](https://arxiv.org/pdf/1707.08817.pdf)

Curt-Park updated 5 years ago
13
PacktPublishing/Hands-On-Intelligent-Agents-with-OpenAI-Gym #6

Question about A3C (ch8)

Hi, I have a question about the plots presented in `Ch8`, in the section of **"Training and testing the deep n-step advantage actor-critic agent"** in the book. The Tensorbroad plots in this se…

AliBaheri updated 5 years ago
26
normandipalo/curiosity-robot #1

Unable to pick the Block

Hi @normandipalo, amazing implementation of PPO. I tried to run the code for 10000 episodes, In the end, the robot acquires a behaviour to move the block randomly which is intuitive as it is trained…

Ameyapores updated 5 years ago
1
germain-hug/Deep-RL-Keras #1

Question about A2C

Hi there, thanks for sharing your code -- its been very helpful! One question: is your implementation of the A2C a 'genuine' actor-critic method? My (limited) understanding was that to qualify as …

Khev updated 5 years ago
4
google-deepmind/trfl #6

Add/alias dpg critic update

Hi, the DPG critic update (see Algorithm 1 of Lillicrap et al. 2016, https://arxiv.org/abs/1509.02971) is substantively the same as your td_learning function; however, this is currently obscured. I wo…

spitis updated 5 years ago
2
kairproject/schedule #13

[120분] DDPG 논문 및 코드 리뷰

whikwon updated 5 years ago
10
src-d/reading-club #21

Next paper candidates: 14 Dec

# Next paper candidates Let's propose papers to study next! All papers mentioned in the comments of this issue will be listed in the next vote.

m09 updated 5 years ago
4

上一页 1...67 68 69 70 71 72 73...75 下一页

750 results for actor-critic-algorithm

750 results
for actor-critic-algorithm