a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

miyosuda/async_deep_reinforce #28

A3C-FF seems not work well?

Hi @miyosuda, thanks for providing the code! When I experimented it with other games than pong (only the ROM name and ACTION_SIZE are modified), I found A3C-FF seems not work very well. For example, a…

pengsun updated 6 years ago
9
calclavia/rl #6

Error while saving (a3c example)

While trying the a3c example provided I encountered the following error: ``` Training model Training ACAgentRunner... [2017-04-10 16:30:50,699] Making new env: CartPole-v0 Training ACAgentRunne…

kfeeeeee updated 7 years ago
1
awjuliani/DeepRL-Agents #48

Adapting A3C LSTM for Pong

Did anyone managed to get the A3C LSTM of this repo to work for Pong (using the openai gym)? I have already tried several different optimizers, learning rates, network architectures, but still no …

MatheusMRFM updated 6 years ago
21
awjuliani/DeepRL-Agents #21

A3C Basic Doom: Loss Function

Hi Our goal is to minimize the loss. Loss consists of three parts: - Value loss - Policy loss - Entropy (to encourage exploration) As follows: ``` self.value_loss = 0.5 * tf.reduce_su…

IbrahimSobh updated 7 years ago
1
awjuliani/DeepRL-Agents #30

A3C Doom Basic: Skip Count

hi Based on [ViZDoom](http://www.cs.put.poznan.pl/wjaskowski/pub/papers/Kempka2016ViZDoom.pdf) paper figure 7, I tried to use skip count to speed up the training, as follows: `r = self.env.ma…

IbrahimSobh updated 6 years ago
1
inarikami/keras-rl2 #13

please add PPO, A3C...

Guys, Keras-rl is the best reinforcement learning library. easy to handle despite complex rl algorithmic. Keras-rl is far moore better than stable baseline. please add ppo, a3c and other as dqn is …

arbitrage-technology updated 4 years ago
1
MorvanZhou/Reinforcement-learning-with-tensorflow #121

a3c的疑问

莫凡您好，我最近用您的a3c，看代码中有些疑惑向您请教： 1. A3C_RNN.PY的150行中，buffer_r.append((r+8)/8)，这里为何要把奖励这样变呢？ 2. 186行中，GLOBAL_RUNNING_R.append(0.9 * GLOBAL_RUNNING_R[-1] + 0.1 * ep_r)，用于显示的总奖励为何要这样算呢？

icesit updated 5 years ago
3
ray-project/ray #46776

Install Ray version 1.5.2

I want to run the extended library done by frenkowski but I'm having trouble installing the version suggested in this library and I can't fix it. Is the problem with the Python version? What version …

safaieabolfazl updated 2 months ago
3
miyosuda/async_deep_reinforce #4

Stock Trading Game using n-Step Q-Learning A3C FF ( Algorit…

I do have some code for a Stock Trading game that is using Deep Q ( just standard Deep Q Learning with Experience Play, but i would like to use A3C LSTM with Experience Play as per the research paper …

developeralgo8888 updated 8 years ago
2
eugenevinitsky/sequential_social_dilemma_games #210

Visualization using A3C agents in harvest env

Hello everybody. After learning 5 agents in harvest environment using A3C algorithm and for example baseline method. How can I save the movie of a test run using saved model and checkpoints? I tried…

farinazAH updated 8 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for a3c

1000+ results
for a3c