-
When running A2C , the envelopes does not run at a parallel manner as can be seen from the ubuntu system monitor.
This hurts the performance, and makes **A3C** an even choice, even while using GTX 10…
-
Could I ask how long it takes to train Breakout from scratch to get the desire score (859.57 for Breakout-v3)?
Have You tried BreakoutNoFrameskip? This is a version without repetition and randomnes…
-
Seeing the CI outputs below, exceptions raised in A3C training are currently ignored. The tests should fail instead when exceptions are raised in child processes.
https://travis-ci.org/chainer/chai…
-
First off, terrific work on repo and blog post, very detailed and clear.
I was able to solve the BipedalWalkerHardcore-v2, average 300+ for 100eps, with rl with an a3c implentation I made but it t…
-
I noticed that in your player_util.py action_train function:
```
if self.done:
if self.gpu_id >= 0:
with torch.cuda.device(self.gpu_id):
self.cx = Variable(torch.z…
-
Hi, I tried to run your code.
I ran the train-qgen-reinforce.py (with pre-trained model provided).
The initial score is similar to your accuracy provided README.
I have a question. Is the pre-trai…
-
We discuss possible ways to organize the source code and distribute our Python packages.
Terminology
==========
First of all, a distinction must be made between various terms referring to the d…
-
Hi,
If I want to use A3C, do I need to open multiple instances and each instance is connected by a client? Or I can connect multiple clients to one instance?
-
### System information
- **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: MacOS 10.13.3
- **Ray installed from (source or binary)**: source
- **Ray version**: cloned from repo on Ma…
-
Hi,
When I run the a3c from this [repo](https://github.com/apache/incubator-mxnet/tree/master/example/reinforcement-learning/a3c) using the file launcher.py, it can specify the number of workers. W…
ghost updated
6 years ago