a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rlcode/reinforcement-learning #101

5_A3C Cartpole Script - AttributeError: 'Functional' object …

References a function that doesn't exist.

windowshopr updated 3 years ago
4
rlcode/reinforcement-learning-kr #13

load_weights - 모델의 predict()의 액션값이 항상 같음

안녕하세요. 'reinforcement-learning-kr/3-atari/1-breakout/breakout_a3c.py' 코드에서 매우 높은 학습 결과(성능)을 나타내는 모델의 weights 저장한 후 다시 불러와서 play할 때에, 마치 처음 학습하는 것과 같이 항상 같은 값(액션)을 계속 리턴합니다. 비슷한 문제가 다루어 지는지 몇몇 이…

peterkiminno updated 6 years ago
2
ugo-nama-kun/gym_torcs #50

How to get image observation as an RGB value array?

We are trying to get an observation output of the vision image for each step in order to write an A3C algorithm with tensorflow that will be able to learn from vision. We have made sure that the re…

JJMUSA updated 2 years ago
5
zackchase/mxnet-the-straight-dope #362

Training on multiple processes

Is there any examples / tutorials on how to use the tools/launch.py that supports training on different processes, either on different machines or on a single machine? The documentation touched ve…

dai-dao updated 6 years ago
1
shamanez/Target-Driven-Visual-Navigation-A3C-USF-LSTM #2

Where can I find the code of the paper?

thank you very much.

YhsCandy updated 3 years ago
4
clab/dynet #1271

Functions which can directly set gradient to Parameters or L…

To implement the reinforcement learning algorithms like A3C, directly setting the gradient of `Parameters` and `LookupParameters` will be necessary. e.g. `Parameters.set_grad(self, array)` Further,…

speedcell4 updated 6 years ago
4
ikostrikov/pytorch-a3c #55

how to under ensure ensure_shared_grads?

I am kind of confused of the ensure_shared_grads here https://github.com/ikostrikov/pytorch-a3c/blob/master/train.py#L13. Here, the `grad` is synced only when it is `None`. I think we need to set `sha…

luochao1024 updated 5 years ago
1
muupan/async-rl #2

Not sample efficient enough

From Figure 6 in the paper, their A3C only needs 20 epochs (20 million steps) to achieve average scores of around 400 at Breakout. My current implementation needs more.

muupan updated 8 years ago
4
eugenevinitsky/sequential_social_dilemma_games #156

Error running visual lizer_rllib.py

Hello, I am very interested in your article, but I encountered the following errors in code execution. I hope I can get your guidance. Errors occurred: OMP: Info#212: KMP_AFFINITY: decoding x 2 …

liuxgff updated 5 years ago
7
ah-ryeong/Ahgorithm #8

[JavaScript] 모의고사

**문제** 수포자는 수학을 포기한 사람의 준말입니다. 수포자 삼인방은 모의고사에 수학 문제를 전부 찍으려 합니다. 수포자는 1번 문제부터 마지막 문제까지 다음과 같이 찍습니다. 1번 수포자가 찍는 방식: 1, 2, 3, 4, 5, 1, 2, 3, 4, 5, ... 2번 수포자가 찍는 방식: 2, 1, 2, 3, 2, 4, 2, 5, 2, 1,…

ah-ryeong updated 1 year ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for a3c

1000+ results
for a3c