-
On FetchPush-v1, after some timesteps, the model always takes the same action (using TD3).
Trying to use SAC + SDE to solve.
-
This is an excellent repo! Thank you to the authors. I would like to know if there are any plans to add [PD-MORL](https://github.com/tbasaklar/PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Le…
-
https://github.com/LucasPautasso/Ejercicios2021-TD3-LucasPautasso/blob/24d14ad4f8ea3fd8c3ae5fbbeba3c381e5e14aeb/Ej22-MasSemaforos/src/main.c#L68
-
### What happened + What you expected to happen
By default, `normalize_actions` is set to `True` in Trainer config for `Box` action space.
https://github.com/ray-project/ray/blob/c0ec20dc3a3f733fd…
-
### Objectifs
- Commentez cette issue pour vous faire connaître en tant qu'utilisateur Github
> c'est juste un ping pour indiquer vos Prénom & Nom
Vous avez été invité dans la [team TD3](https://…
-
- Value based RL
- [ ] DQN
- [ ] Rainbow DQN
- [ ] [CQL](https://sites.google.com/view/cql-offline-rl)
- Value based + Policy based RL
- [x] DDPG
- [ ] [TD3](https://spinni…
-
### 🐛 Bug
Using TD3 as an exmaple, if the the `noise_type` is not specified for a custom environment in td3.yml. The following weird behavior happens:
The logic of deciding `n_actions` would be …
-
The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:
numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow=…
-
https://github.com/brunorubiolo/Ejercicios2021-TD3-Rubiolo/blob/e6698220196dfc81cc8f711d58c8069987d26871/Ej26-AccesoConcurrenteContador/src/pulsador.c#L84
-
Hi, thank you for your work, it's amazing! I'm a student who just started DRL. I set up the simulation environment according to the tutorial and used your original program to train (by executing 'pyth…