advantage-actor-critic Search Results

291 results
for advantage-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rlbayes/rllabplusplus #6

DDPG not converging/slow training?

Hello, i was trying to train a Hopper Gym agent with the rllab++ version of DDPG found in `sandbox/rocky/tf/algos/ddpg.py`. I initially run the experiment as suggested by @shaneshixiang with the …

andreafranceschetti updated 6 years ago
7
keras-team/keras #3071

Optimizing prediction performance

I am working on reinforcement learning task and it requires calculating prediction too many times. I have found that 56,87% of cumulative time is taken by **_predict_loop** method. Also I have found t…

Serhiy-Shekhovtsov updated 7 years ago
5
numpy/numpy #9548

TypeError: list indices must be integers, not tuple

Operating system: Ubuntu 16.04 x64 numpy version: '1.13.1' python version: Python 2.7.13 |Continuum Analytics, Inc.| (default, Dec 20 2016, 23:09:15) [GCC 4.4.7 20120313 (Red Hat 4.4.7-1)] on linu…

Jiankai-Sun updated 6 years ago
2
Alfredvc/paac #1

Adapting paac for CartPole

Hi, Thanks for the great implementation, I am currently learning RL and I am trying to adapt paac for a simple use case of CartPole. I made modifications to paac code to include a new environment…

zencoding updated 7 years ago
8
Kaixhin/Atari #5

Implement asynchronous methods

http://arxiv.org/pdf/1602.01783v1.pdf describes asynchronous methods using off policy (1 step /n step Q learning) and even on policy (sarsa and advantage actor-critic (A3C)) reinforcement learning. T…

lake4790k updated 7 years ago
14
dennybritz/reinforcement-learning #16

Actor Critic for Atari games

Dear Danny Thank you for the great work! I have two questions: **1- Is it possible to change the “CliffWalk Actor Critic Solution.ipynb” code to implement Actor-Critic for Gym Arari games?** I b…

IbrahimSobh updated 7 years ago
8
ppaquette/gym-super-mario #4

Controlling emulator speed

Hi Philip, I was wondering whether it's possible to manually set the emulator speed. It'd be nice to further increase the speed, say to 5000%, during training. Additionally, when demoing the RL agent,…

gabegrand updated 7 years ago
11
NVlabs/GA3C #22

Trying to compare this to universe-starter-agent (A3C)

Setting up openai/universe, I used the "universe starter agent" as a smoke test. After adjusting the number of workers to better utilize my CPU, I saw the default PongDeterministic-v3 start winnin…

nczempin updated 7 years ago
83
siemanko/tensorflow-deepq #20

Links to theory for continuous branch

Hi, A non-technical question, I hope its OK to ask here in github... I am working on continuous robot control problems and was wondering which approach you are following for the continuous branch. …

meppe updated 8 years ago
1
yandexdataschool/AgentNet #23

Learned baseline

Implement an algorithm that learns common baseline for Q-values http://arxiv.org/pdf/1301.2315.pdf

justheuristic updated 8 years ago
2

上一页 1...24 25 26 27 28 29 30...30 下一页

291 results for advantage-actor-critic

291 results
for advantage-actor-critic