a3c-lstm Search Results

160 results
for a3c-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

minosworld/minos #48

benchmark code of four navigation algorithms

Hi, Will the benchmark code of four navigation algorithms in the paper be released? Also, how long does it take to train the agents? My english is poor, so I have some confusion about the followin…

nina124 updated 6 years ago
6
NVlabs/GA3C #3

LSTM version

It is a great work. Is there any plan to develop a LSTM version?

markovyao updated 4 years ago
23
xiaobaishu0097/ECCV-VN #1

Does the training process need depth data？

The depth data is not included in the data set you shared, but this error message appeared during the training process. Training started from: 2020-09-03 10:23:59 Scene Data Exists! initialized o…

sx-zhang updated 8 months ago
6
Kismuz/btgym #97

Bug: noisy-net layer

hi @Kismuz I was reading the paper "Noisy Network for exploration". And have a question w.r.t its usage in btgym. The paper says that "As A3C is an on-policy algorithm the gradients are unbiased w…

mysl updated 5 years ago
8
Itsukara/async_deep_reinforce #3

Experiment Results

This thread is used for sharing experiment results. I'd appreciate if you could write your experiment result to this thread when you try my code. The following messages are sample reports.

Itsukara updated 8 years ago
6
Kismuz/btgym #124

Overestimated Value Function in Actor Critic Framework

@Kismuz, I believe I have encountered a framework (A3C) limitation. While training a few of my recent models I noticed a strange behavior. For the first part of training everything seems to work fi…

JaCoderX updated 4 years ago
7
quantylab/rltrader #86

main.py가 실행이 안됩니다.

아래의 명령으로 실행했습니다. python main.py --stock_code 005930 005380 015760 --rl_method a3c --net lstm --num_steps 5 --learning --num_epoches 1000 --lr 0.001 --start_epsilon 1 --discount_factor 0.9 --output_na…

hola-ai updated 3 years ago
1
apache/mxnet #18280

[Performance Regression] GPU memory increase for training an…

## Description - There is an MXNet nightly benchmark which runs CV and NLP models on MXNet Nightly pip wheel and report the metrics and it showed a performance regression on GPU Memory. - After bise…

karan6181 updated 4 years ago
11
miyosuda/async_deep_reinforce #1

Problem while using the code

Hello @miyosuda, Thanks for sharing the code, please ignore the title, I tried out your code with the control problem of cartpole balance experiment instead of Atari game, it works well. But few ques…

originholic updated 7 years ago
78
devsisters/DQN-tensorflow #61

Any one who can share model details?

class M1(DQNConfig): backend = 'tf' env_type = 'detail' action_repeat = 1 class M2(DQNConfig): backend = 'tf' env_type = 'detail' action_repeat = 4 I use python m…

Richardxxxxxxx updated 6 years ago
3

上一页 1...1 2 3 4 5 6 7...16 下一页

160 results for a3c-lstm

160 results
for a3c-lstm