-
**Describe the bug**
Hey Kris, love your framework! Working with a custom environment, and your discrete action unit test works perfect locally. Don't spend much time investigating this yet, just cre…
-
Hi, what should I do if I run the Pendulum-v0 environment with this offpolicy ppo code of yours? Can you please write a simple modified code?
-
hello!
Did you successfully complete the control of the inverted pendulum?
```bash
iteration: 0 cost: 46906.8821282
iteration: 1 cost: 39965.4489433
iteration: 2 cost: 44619.0937448
iterat…
-
Hello,
I was trying to find a way to make the ARS implementation I was working on [in this pr](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/42) faster. My first thought …
-
### ❓ Question
I am currently using SB3 version 2.3.2 and gym 0.25 and am having the seed problem. I checked the code on github here and saw that seed should not be passed to monitor.py but I do no…
-
@Hyeokreal which Tensorflow and Keras version do yo use?
I tried running a3c_continuous.py, but I get these errors:
```
2018-02-16 11:55:39.292348: I tensorflow/core/platform/cpu_feature_guard.…
-
`env = gym.make('Pendulum-v0').unwrapped
ppo = PPO()
all_ep_r = []
for ep in range(EP_MAX):
s = env.reset()
buffer_s, buffer_a, buffer_r = [], [], []
ep_r = 0
for t in range(E…
-
Hi, when running the code:
`python -m baselines.run --alg=ppo2 --env=Pendulum-v0 --num_timesteps=0 --load_path=~/models/Pend-v0 --play`
The error is as follows:
`Running trained model
2019-06-22 1…
-
I wonder if lstm+ppo/sac could use in Tianshou? Since there are some problems.
-
Current draft of the notebook is [here](https://github.com/cwbeitel/kubeflow-rl/blob/master/examples/agents_ppo/demo.ipynb)
Per @jlewi's suggestions using jsonnet to template tf-job's instead of ji…