-
### 🐛 Describe the bug
I followed this [tutorial](https://pytorch.org/tutorials/intermediate/rpc_tutorial.html) to implement reinforcement learning with RPC on Torch. And I can run the original tuto…
-
Hello @miyosuda,
Thanks for sharing the code, please ignore the title, I tried out your code with the control problem of cartpole balance experiment instead of Atari game, it works well. But few ques…
-
## Description
Currently, MXNet only supports tensor size smaller than 2^31. To support large tensors, users need to recompile MXNet with USE_INT64_TENSOR_SIZE compiler flag set to ON.
Large tens…
-
-
Hello,
May I ask a naive question, did you try to implement LSTM on this architecture? Or you already did it and find it is not efficient (maybe time consuming?) as people think.
In any case th…
-
아래의 명령으로 실행했습니다.
python main.py --stock_code 005930 005380 015760 --rl_method a3c --net lstm --num_steps 5 --learning --num_epoches 1000 --lr 0.001 --start_epsilon 1 --discount_factor 0.9 --output_na…
-
Hi
I would really like to use this environment for Deep RL reserach purposes. But I'm not able to get it to work. Please help. Thanks
Using TensorFlow backend.
[2017-08-28 17:41:07,956] Making …
-
Hi,I'm very interested in your project and try to test the agent according to your README. And I encounter a few confusing problems.
1、After I run the `make_submission_file.py` directly using the fol…
-
In the simple policy gradient implementation [here][simple_pg], all of the observations, actions, and rewards for one "epoch" (potentially consisting of multiple episodes) are gathered into the same l…
-
### What happened + What you expected to happen
Hi, I am using a self-play scheme on SImple_tag_v2 of Pettingzoo, that works on a previous installation of ray_300_dev0 and al old ray 1.2.0 (with modi…