a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #106711

'CUDA out of memory' when using a GPU services for reinforce…

### 🐛 Describe the bug I followed this [tutorial](https://pytorch.org/tutorials/intermediate/rpc_tutorial.html) to implement reinforcement learning with RPC on Torch. And I can run the original tuto…

lpf6 updated 1 year ago
4
miyosuda/async_deep_reinforce #1

Problem while using the code

Hello @miyosuda, Thanks for sharing the code, please ignore the title, I tried out your code with the control problem of cartpole balance experiment instead of Atari game, it works well. But few ques…

originholic updated 7 years ago
78
apache/mxnet #17331

[mxnet 2.0] [item 2.4] Turning on large tensor support by de…

## Description Currently, MXNet only supports tensor size smaller than 2^31. To support large tensors, users need to recompile MXNet with USE_INT64_TENSOR_SIZE compiler flag set to ON. Large tens…

apeforest updated 4 years ago
25
muupan/async-rl #3

Continous control

muupan updated 7 years ago
6
Alfredvc/paac #2

add LSTM layer

Hello, May I ask a naive question, did you try to implement LSTM on this architecture? Or you already did it and find it is not efficient (maybe time consuming?) as people think. In any case th…

chihchiehchen updated 7 years ago
9
quantylab/rltrader #86

main.py가 실행이 안됩니다.

아래의 명령으로 실행했습니다. python main.py --stock_code 005930 005380 015760 --rl_method a3c --net lstm --num_steps 5 --learning --num_epoches 1000 --lr 0.001 --start_epsilon 1 --discount_factor 0.9 --output_na…

hola-ai updated 3 years ago
1
mryellow/gym-mazeexplorer #1

Environment not registered in OpenAI Gym

Hi I would really like to use this environment for Deep RL reserach purposes. But I'm not able to get it to work. Please help. Thanks Using TensorFlow backend. [2017-08-28 17:41:07,956] Making …

arsenious updated 7 years ago
7
ZM-Learn/L2RPN_WCCI_a_Solution #1

About the testing result

Hi，I'm very interested in your project and try to test the agent according to your README. And I encounter a few confusing problems. 1、After I run the `make_submission_file.py` directly using the fol…

ydlu updated 2 years ago
5
openai/spinningup #59

Simple policy gradient: mean over episodes?

In the simple policy gradient implementation [here][simple_pg], all of the observations, actions, and rewards for one "epoch" (potentially consisting of multiple episodes) are gathered into the same l…

neighthan updated 5 years ago
4
ray-project/ray #34778

[RLlib] Error on self-play with Simple_tag

### What happened + What you expected to happen Hi, I am using a self-play scheme on SImple_tag_v2 of Pettingzoo, that works on a previous installation of ray_300_dev0 and al old ray 1.2.0 (with modi…

george-skal updated 2 months ago
5

上一页 1...30 31 32 33 34 35 36...100 下一页

1000+ results for a3c

1000+ results
for a3c