rl-algorithms Search Results

1000+ results
for rl-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 1 month ago
1906
pytorch/pytorch #90141

Compiled model cannot forward for pytorch 2.0

# 🐛 Describe the bug Hello, I download pytorch2.0 and play with the toy example ```python import torch import torchvision.models as models import faulthandler faulthandler.enable() model …

sweetice updated 11 months ago
3
Pi-Star-Lab/RESCO #12

Indiscriminate Yellow Steps

I am walking through the implementation and it seems that the `MultiSignal.step` enforces a peculiarity where if the actor resolution is < than the yellow time of signals, then all actors are unable t…

mschrader15 updated 10 months ago
5
eclipse-sumo/sumo #14180

any implement for car following with ddpg?

Hi, All. Nice work! I would like to ask if your sumo-rl library can implement multi-agent vehicle control? For example, I am trying to make a scene: 1km straight single-lane for car following . There …

laoyouf updated 11 months ago
9
ScheiklP/sofa_zoo #5

Multi-agent environment

Hi, @ScheiklP Sorry disturbing you again. Some time ago the question of "successful_task" was resolved. I drew the following line diagram with wandb. Set "number_of_envs" to 8 and the results were …

wjyustl updated 8 months ago
5
KTH-FlowAI/DeepReinforcementLearning_RayleighBenard2D_Control #10

On Hyperparameters

Hi all, I have a couple of questions regarding the chosen hyperparameters (i.e., network architecture, PPO hyperparameters, etc) How did you decide on these specific values? (did you run a hyperpa…

VikasChidananda updated 1 year ago
4
metadriverse/trafficgen #24

Fail to run 'run_rl_training.py'

Thank you so much for your work, When I run `pythonrun_rl_training.py --exp-name rl_test1 --num-gpus 3 --dataset_train /workspace/datasets/generated_1385_training/1385_training/ --dataset_test / work…

XiaomuWang updated 1 year ago
2
UM-ARM-Lab/pytorch_kinematics #8

How to compute the gradient of reward w.r.t. model parameter…

My research interest is robustness of RL algorithms to environment parameters. I want to modify currents RL algorithms to make them achieve good performance when they are tested in environments with u…

c4cld updated 1 year ago
2
Farama-Foundation/ViZDoom #441

Reward implementation for DeadlyCorridor

Hi guys! I'm using Vizdoom for my bachelor thesis experiments and trained the agents in DeadlyCorridor. I implemented a probability calculator for choosing the action "Attack" which is ridiculousl…

juice1000 updated 11 months ago
3
lucidrains/PaLM-rlhf-pytorch #41

Confusion about KL divergence calculation for human feedback…

Hi, thanks for the great work. I also have a question about KL divergence loss. In papers like [Learning to summarize from human feedback](https://arxiv.org/pdf/2009.01325.pdf), the KL item for huma…

dwyzzy updated 12 months ago
13

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for rl-algorithms

1000+ results
for rl-algorithms