a3c-pytorch Search Results

212 results
for a3c-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #2560

Speed difference in .backward() between pytorch versions

Hi, I recently **upgraded to pytorch 0.2.0 from 0.1.12** and noticed that it drastically slowed down an a3c implementation I tested. I investigated what could be the problem and found out that the **.…

lukashermann updated 4 years ago
6
rlworkgroup/garage #1007

Question: Is Actor-Critic related algorithms being added any…

kishanpb updated 4 years ago
1
google-deepmind/lab #53

Lab + python multiprocessing

I have implemented A3C with multiprocessing (+ pytorch) as opposed to using threads, however `bazel run` seems to break silently and clearly without any visible trace. This is what I do: ```bash $…

edran updated 4 years ago
15
ray-project/ray #2278

[solved] LZ4 Installation Issues

Hi, I am trying to follow the ray [tutorials](https://github.com/ray-project/tutorial). Nevertheless, I found several bug with save/load functional in ray 0.4 and decide to move to the master brach…

Scitator updated 4 years ago
6
ray-project/ray #5248

I tried to replicate the tf impala by using pytorch but the …

I use same optimizer, same model architecture and same initialization, but the performance is bad.

sufeidechabei updated 4 years ago
23
ray-project/ray #3994

Pytorch A3C - unstable runs

### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: Linux Ubuntu 16.04 - **Ray installed from (source or binary)**: source - **Ray version**: 0.5.3 - **Python…

Anya28 updated 5 years ago
5
wuhuikai/TF-A2RL #5

How to train the model from scratch?

Thank you for sharing your work. It helps a lot. It seems the code does not include training part? (please correct me if I'm wrong) Would you kindly include it as well, thanks so much.

w121211 updated 5 years ago
3
llan-ml/tesp #2

关于shared policy的更新问题

在论文中给出的算法伪码，我理解的是，不论是task encoder的更新还是策略的更新都是直接对公式(10)进行SGD，这是一个meta-update，应该可以简单地理解成一个梯度更新。但是在源码中，我看到了ppo和a3c以及它们的loss，请问tesp需要借助ppo或者a3c的策略更新方式么？

zstbackcourt updated 4 years ago
5
ray-project/ray #4917

Local Mode - TypeError: unhashable type: 'list'

I'm training an A3C trainer with Pytorch and tune.run(). tune.run(a3c.A3CTrainer, local_dir=".", stop={"episode_reward_mean": 0.5}, resources_per_trial={"cpu":1, "gpu":1}, …

sheetalsh456 updated 5 years ago
6
uvipen/Super-mario-bros-A3C-pytorch #5

AttributeError: 'Monitor' object has no attribute 'pipe'

Hello, I have an error when I run test.py. `python3 test.py Traceback (most recent call last): File "test.py", line 68, in test(opt) File "test.py", line 58, in test state, rewar…

xufeng1024 updated 5 years ago
2

上一页 1...10 11 12 13 14 15 16...22 下一页

212 results for a3c-pytorch

212 results
for a3c-pytorch