-
Tried to run rl with new gym wrapper code and it gives out the following error
`expected dense_input to have shape (1, 10) but got array with shape (1, 2)`
this is the code `import asyncio
from…
-
Hi there,
in a2c_common.py line 194:
` self.minibatch_size = self.config.get('minibatch_size', self.num_actors * self.minibatch_size_per_env)
`
shouldn't it be
` self.minibatch_s…
-
Hi, I have the doubt.
In this distributed RL, because of OS scheduling, every process will have the near state and do the same thing?
so all process will update the network and then sync the grad t…
-
### What
Related to the following
* Main: p9F6qB-ad1-p2
* Painpoints: p58i-a2c-p2
* Simplifying HE Jetpack Reconnections / Exploring Internal Tooling Options - p9F6qB-7G3-p2
* Notes on cur…
-
I got this error after run "FinRL_Ensemble_StockTrading_ICAIF_2020"
**To Reproduce**
Steps to reproduce the behavior:
1. restart and run all
-------------------------------------
| time/ …
-
hi,
I want to use your model to fit my observation data, but when I use the logg to input the code fitting, the fitting always failed . So that, I checked the model's output, discovering the a1c is …
-
I can run the code on PongNoFrameskip-v4 without problems:
`python main.py --env-name "PongNoFrameskip-v4" --algo ppo`
However when I run the code on CartPole-v0:
`python main.py --env-name "Cart…
-
I've tried command ``python -m baselines.a2c.run_atari``. And when I inserted one line ``print("env's address is : {}".format(id(env)))`` before [subproc_vec_env.py](https://github.com/openai/baseline…
-
I am benchmarking OpenAI baselines implementation of A2C in Acrobot-v1. There is a phenomenon
observed, because baselines original implementation does not normalize either advantage estimate before…
-
Hi:
First of all thank you for creating such a great work.
The action type I used is vel, and I used A2C training, it can fly in the right direction, but it continues to fly after reaching the en…