actor-critic-algorithm Search Results

752 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #40321

[RLlib] Check reparameterization trick for squashed gaussian…

### What happened + What you expected to happen For context see: https://discuss.ray.io/t/malformed-reparameterization-trick-in-squashed-gaussian/9651/3 Raised here as an issue at the behest of @A…

gresavage updated 8 months ago
1
openai/safety-starter-agents #4

sac-lagrangian shows poor performance on PointGoal1?

On running the lagrangian version of SAC I get the following curve for costs. I tried changing the constraint limits to a range of values and didn't get much benefit: ![lagrangian_sac_pointgoal1](h…

hari-sikchi updated 5 months ago
16
DigiRL-agent/digirl #17

Keyerror in loading final ckpt

Hi, I try to reproduce scores in your paper with your final checkpoint then get the following error. [2024-09-09 19:41:34,814][accelerate.checkpointing][INFO] - All model weights loaded successfull…

yipclam updated 1 week ago
3
openai/baselines #168

Is MPI being used in baselines DDPG?

I don't see any reference to mpiexec when searching in the repo. It it intended that we run with mpiexec to get a parallel version of DDPG? eg I've tried this: `mpiexec -n 4 python -m baselines.d…

hagrid67 updated 6 years ago
7
howardyclo/papernotes #20

Synthesizing Programs for Images using Reinforced Adversaria…

### Metadata Authors: Yaroslav Ganin, Tejas Kulkarni, Igor Babuschkin, S. M. Ali Eslami, Oriol Vinyals Organization: DeepMind Release Date: Arxiv 2018 Paper: https://arxiv.org/pdf/1804.01118.pdf …

howardyclo updated 6 years ago
1
kundtx/lfd2022-comments #29

Learning from Data (Fall 2022)

http://8.129.175.102/lfd2022fall-poster-session/19.html

kundtx updated 1 year ago
8
ray-project/ray #38561

[RLlib] PPO instantiation requires torch even though tf is t…

### Issue Severity Minor: Workaround available, torch must be installed additionally. ### What happened + What you expected to happen PPO Trainer instantiation via RLModule API fails if I wan…

PhilippWillms updated 5 months ago
2
HorizonRobotics/alf #1096

Multi-GPU Training with DDP

This is a follow-up to #913 # Motivation Add full support for multi-process and multi-GPU training in alf with pytorch's [DDP](https://pytorch.org/docs/stable/notes/ddp.html). # Goals - […

breakds updated 2 years ago
14
DLR-RM/stable-baselines3 #1984

[Feature Request] Temporal Convolutional network

### 🚀 Feature Hello guys, After watching this video : [https://www.youtube.com/watch?v=WoLlZLdoEQk](url) I had the idea to extend the NatureCNN to NatureCTN1D this way : ``` class Chomp1d(nn…

tty666 updated 1 month ago
1
ray-project/ray #42064

[RLlib] When setting `config.environment(normalize_actions=F…

### What happened + What you expected to happen when the CQL algorithm is configured with `config.environment(normalize_actions=False,)`, and the `policy.dist_class` is `TorchDiagGaussian`, it resu…

FuBaoLoong updated 7 months ago
1

上一页 1...13 14 15 16 17 18 19...76 下一页

752 results for actor-critic-algorithm

752 results
for actor-critic-algorithm