actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

leggedrobotics/rsl_rl #33

[Bug Report] actor's std becomes "nan" during PPO training

I am conducting reinforcement learning for a robot using rsl_rl and isaac lab. While it works fine with simple settings, when I switch to more complex settings (such as Domain Randomization), the foll…

mitsu3291 updated 1 week ago
7
leggedrobotics/rsl_rl #7

action noise parameterization

https://github.com/leggedrobotics/rsl_rl/blob/master/rsl_rl/modules/actor_critic.py#L121 The action noise had better use positive parameterization, e.g. self.distribution = Normal(mean, torch.ones_…

twu-mrc updated 1 week ago
1
thu-ml/tianshou #1142

How can I make action sampling within the range specified by…

Hi, I am new to tianshou and RL. I created a env and used ppo in tianshou to run. But I found the action sampling is out of range. So I searched for, and I found map_action. But it seem not used in tr…

lidaken updated 2 months ago
6
isaac-sim/IsaacLab #673

[Bug Report] with rsl_rl, actor's std becomes "nan" during P…

I am conducting reinforcement learning for a robot using rsl_rl and isaac lab. While it works fine with simple settings, when I switch to more complex settings (such as Domain Randomization), the foll…

mitsu3291 updated 1 week ago
2
Kismuz/btgym #124

Overestimated Value Function in Actor Critic Framework

@Kismuz, I believe I have encountered a framework (A3C) limitation. While training a few of my recent models I noticed a strange behavior. For the first part of training everything seems to work fi…

JaCoderX updated 4 years ago
7
coreylynch/async-rl #8

Stop actor gradient flowing through the critic

I think you should use `tf.stop_gradient()` in https://github.com/coreylynch/async-rl/blob/master/a3c.py#L164. Otherwise, after some training the policy tends to use one action exclusively. Took me a …

danijar updated 7 years ago
2
pytorch/pytorch #74134

No module named 'pygame': soft_actor_critic

alanwaketan updated 2 years ago
1
OpenRLHF/OpenRLHF #295

QLORA model loading error

Hi team getting the following error while enabling 4-bit and LORA ``` File "/root/miniconda3/envs/open/lib/python3.11/site-packages/deepspeed/runtime/engine.py", line 262, in __init__ self._c…

karthik-nexusflow updated 2 months ago
5
kgex/developer-roadmap #497

Add Asynchronous Advantage Actor-Critic (A3C) Algorithm reso…

DineshkumarS05 updated 1 year ago
5
kgex/developer-roadmap #503

Add Asynchronous Advantage Actor-Critic (A3C) Algorithm reso…

DineshkumarS05 updated 1 year ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic