actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

YangRui2015/RORL #3

Some problems about the implementation of RORL

Hi, I am currently encountering some issues while trying to implement RORL，here is the problems： 1. The training time for RORL seems to be quite long (due to the additional calculation of 3 los…

awecefil updated 7 months ago
4
ctsrc/Base256 #1

This is really cool!

I really like the thought process behind this. Nice idea.

mentalisttraceur updated 8 months ago
4
isaac-sim/IsaacLab #541

[Bug Report] UR10 reach environment won't accept all wrapper…

### Describe the bug UR10 reach can be trained with RSL_RL but not with RL_GAMES and SKRL. ### Steps to reproduce Please try to provide a minimal example to reproduce the bug. Error message…

eferreirafilho updated 4 months ago
9
DLR-RM/stable-baselines3 #1850

[Question] Continuous action space must have a finite lower …

### ❓ Question I want to use soft actor-critic in an environment with continuous state and action spaces. The environment is implemented in a `gym.Env` class with state and action spaces of the type …

onnoeberhard updated 9 months ago
2
DigiRL-agent/digirl #11

Error in loading final checkpoints

I want to reproduce evalution only results using final checkpoints you provided. I replaced policy_lm section with the checkpoint path in the default.yaml. The following error occurs when loading…

mousewu updated 2 months ago
6
twitter/the-algorithm #490

Determine shadow ban status

A clear metric should be given for every user to determine this status for greater transparency.

Noah670 updated 7 months ago
40
tensorflow/tensorflow #49601

Model divergence in a TD3 implementation converted from pyto…

Please make sure that this is a bug. As per our [GitHub Policy](https://github.com/tensorflow/tensorflow/blob/master/ISSUES.md), we only address code/doc bugs, performance issues, feature requests a…

ghost updated 8 months ago
4
facebookresearch/Pearl #69

Unable to export loss values from TD3

First, thank you very much for this wonderful package. Visualization of loss trends gives us an indication of where the training process is bound for. Where TD3 is concerned, i think that ther…

zchiam002 updated 9 months ago
12
luchris429/purejaxrl #19

Different results when running PPO with the same seed multip…

Hey all! I'm trying to track down a seeming reproducibility issue I'm having with the PPO implementation after I added some simple WandB logging. I ran the same code 7 times, and 4 of the times the re…

Chulabhaya updated 8 months ago
3
ray-project/ray #15363

[rllib] Error with centralised critic PPO for multiagent env…

Hi! I am using ray 1.2.0 and Python 3.6 on Ubuntu 18.04 I am trying a centralised critic PPO for the waterworld environment from Pettingzoo[sisl] [https://www.pettingzoo.ml/sisl/waterworld](url). …

george-skal updated 5 months ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic