-
I wonder if it would be useful to add a solver that wraps [Richardson Extrapolation](https://github.com/JuliaMath/Richardson.jl) around another solver. You already have a limited form of this (Rombe…
-
Hello,
I've tried in vain to find suitable hyperparameters for SAC in order to solve MountainCarContinuous-v0.
Even with hyperparameter tuning (see "add-trpo" branch of [rl baselines zoo](https:…
-
Hi,
Currently, I have used 4 algorithms from stable-baselines for the task of Roboschool HumanoidFlagrunHarder. My evaluation metric is the mean reward of 100 episodes. Basically: PPO2 is perfect, A2…
-
I'm trying to implement a PPO agent to play with LunarLander-v2 with tf_agents library like it was in [this tutorial](https://pylessons.com/LunarLander-v2-PPO/) ([_github repo_](https://github.com/pyt…
-
## Description
## Reproduce
1. In Jupyter Lab, create a new Text Area widget via
```
from ipywidgets import interact, widgets
widgets.Textarea(
rows=20,
value="d…
-
Hi, first of all, thanks for the great repository!
I was trying to run the pendulum example but get the following error, however, it seems like the code continues till testing 5 episodes. I'm not s…
-
面白そうなので読み始めた。強化学習、よく知らないので楽しみ。
-
**Describe the bug**
Before `4.0.0`, after creating a `PostgresContainer` within another container, but with the same external daemon, `postgres.get_connection_url()` would return something
like `…
-
### Environment
* Operating System:
Manjaro Linux x86_64
* Python Version: `$ python --version`
Python 3.8.2
* How did you install Qgrid: (`pip`, `conda`, or `other (please explain)`)
pip
*…
-
Hi, I tried to implement a DDPG based Actor critic framework using MirroredStrategy,. Without MirroredStrategy code runs perfectly fine. The actor network gets created fine but error appears on tf.gra…