-
Hi Machina,
Great job in implementing the QT-Opt algorithm. This repo is the only full implementation of QT-Opt I found. Do you have validation results on the current implementation?
Thanks,
…
-
Hello,
In sac.py,
```
if t > start_steps:
a = get_action(o)
else:
a = env.action_space.sample()
```
You use random policy before start_steps. But y…
-
@Hyeokreal which Tensorflow and Keras version do yo use?
I tried running a3c_continuous.py, but I get these errors:
```
2018-02-16 11:55:39.292348: I tensorflow/core/platform/cpu_feature_guard.…
-
Episode reward summaries are all concentrated together on a few steps, with jumps in between.
Zoomed out:
![image](https://user-images.githubusercontent.com/1750835/50369978-20aace00-0553-11e9-91a…
-
I just used the following code. And the code ran fine when `import tflearn` was commented. If I have both `import tflearn` and `env.render()`, error occurred.
```python
import tflearn
import tensor…
-
Are there any plans to extend this to approximated dynamics (e.g. with a NN) and using importance sampling instead of sampling trajectories directly from the environment?
(replace `__init__` env arg…
-
I am attempting to access the Atari environments, and upon importing the latest versions of ale-py, autorom, gym, gymnasium even, I get the following error when attempting to make and environment of a…
-
If we change the differential equations solving inside the `diffeq_layer` from the current `EnsembleProblem` to a simple for loop, which forward pass gives the exact same result, the whole training br…
-
Hello,
I was trying to find a way to make the ARS implementation I was working on [in this pr](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/42) faster. My first thought …
-
**Is your feature request related to a problem? Please describe.**
This is a great demo to show possibility of setting realtime profile to ros2 nodes and get a better realtime performance.
But curre…