pytorch-a3c Search Results

212 results
for pytorch-a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LucasAlegre/sumo-rl #88

Multi agent and reward error

Hi, I was trying to use A2C on the ingolstadt21 map, but when I set the ```single_agent=False```. I'd receive this error. ``` Traceback (most recent call last): File "/home/ubuntu/Videos/test.p…

thoithoi58 updated 2 years ago
14
google-deepmind/meltingpot #15

How many hours did you take to train agents in each substrat…

Dear authors, Thanks for building such ambitious environments for MARL research. In your paper, I found it will take 10^9 steps to run the simulation for each agent. In order to train agents, how m…

YetAnotherPolicy updated 2 years ago
12
allenai/savn #26

Minor change to work in pytorch 1.8.1

I just set up this repository following the README docs. The evaluation scripts for Scene Priors and Non-Adaptive A3C worked, but not for SAVN. Because pytorch 0.4.1 is out of date, I tried with 1.8.1…

zkytony updated 2 years ago
6
ray-project/ray #19914

[Bug] 'from ray.rllib.agents.ppo import PPOTrainer' segfault…

### Search before asking - [X] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues. ### Ray Component RLlib ### What happened + What you expect…

dfm794 updated 2 years ago
2
ray-project/ray #22064

Top hit for Pytorch on Ray isn't helpful for starting out

Searching for ["ray pytorch"](https://www.google.com/search?q=ray+pytorch&oq=ray+pytorch) leads to [this page](https://docs.ray.io/en/latest/using-ray-with-pytorch.html) We should update this page…

worldveil updated 2 years ago
11
ray-project/ray #6962

[RLlib] PPO torch over 5X slower than tensorflow on atari an…

### What is the problem? The recently added torch implementation of PPO #6826 is over 5X slower when training on atari (breakout) and also ends up slowly consuming all the system RAM (perf/ram_…

pmacalpine updated 2 years ago
29
pytorch/xla #1969

Multiprocessing (8) is broken on colab

## 🐛 Bug Calling xm.spawn() with nproc=8 makes colab notebook running indefinitely. it works with nproc=1 If i stop the running cell and do !ps -ef | grep python, there are like 30+ python proc…

anhmeow updated 2 years ago
7
Farama-Foundation/PettingZoo #587

How to train magent by stable_baselines3?

Stable_baselines3 can only train the env of "gym"? Or how to convert the magent-type env to gym-type env that can be trained by stable_baselines3?

loveisacat updated 2 years ago
4
pytorch/pytorch #32284

Memory leak in MaxPool2d

## 🐛 Bug It seems like there is very small memory leak during forward and backward propagation through network that can lead to memory overflow after many hours of training. I stumbled upon this…

m-bula updated 2 years ago
28
ray-project/ray #18245

[rllib] Reproduce issue: with the same seed, multiple runs s…

### What is the problem? I found multiple runs for PPO still have different performance even we set the same seed. How can we obtain the exact same result with the same seed? Current strate…

yangysc updated 2 years ago
18

上一页 1...5 6 7 8 9 10 11...22 下一页

212 results for pytorch-a3c

212 results
for pytorch-a3c