-
### What happened + What you expected to happen
For context see: https://discuss.ray.io/t/malformed-reparameterization-trick-in-squashed-gaussian/9651/3
Raised here as an issue at the behest of @A…
-
On running the lagrangian version of SAC I get the following curve for costs. I tried changing the constraint limits to a range of values and didn't get much benefit:
![lagrangian_sac_pointgoal1](h…
-
Hi, I try to reproduce scores in your paper with your final checkpoint then get the following error.
[2024-09-09 19:41:34,814][accelerate.checkpointing][INFO] - All model weights loaded successfull…
-
I don't see any reference to mpiexec when searching in the repo. It it intended that we run with mpiexec to get a parallel version of DDPG?
eg I've tried this:
`mpiexec -n 4 python -m baselines.d…
-
### Metadata
Authors: Yaroslav Ganin, Tejas Kulkarni, Igor Babuschkin, S. M. Ali Eslami, Oriol Vinyals
Organization: DeepMind
Release Date: Arxiv 2018
Paper: https://arxiv.org/pdf/1804.01118.pdf
…
-
http://8.129.175.102/lfd2022fall-poster-session/19.html
-
### Issue Severity
Minor: Workaround available, torch must be installed additionally.
### What happened + What you expected to happen
PPO Trainer instantiation via RLModule API fails if I wan…
-
This is a follow-up to #913
# Motivation
Add full support for multi-process and multi-GPU training in alf with pytorch's [DDP](https://pytorch.org/docs/stable/notes/ddp.html).
# Goals
- […
-
### 🚀 Feature
Hello guys,
After watching this video :
[https://www.youtube.com/watch?v=WoLlZLdoEQk](url)
I had the idea to extend the NatureCNN to NatureCTN1D this way :
```
class Chomp1d(nn…
-
### What happened + What you expected to happen
when the CQL algorithm is configured with `config.environment(normalize_actions=False,)`, and the `policy.dist_class` is `TorchDiagGaussian`, it resu…