actor-critic-algorithm Search Results

751 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rlworkgroup/garage #1077

Average return very low in tf.DDPG

Formula for off_policy_method: total_timeseteps= n_epochs * n_epoch_cycles * batch_size then if n_epochs=1400 n_epoch_cycles=20 batch_size=64 min_buffer_size=10^6 then total_timesteps=140…

surbhi1944 updated 3 years ago
4
YuhangSong/Arena-Baselines #12

Some General Questions

Hi, I am trying to use Arena in my research project. I have several general questions: 1) The [baseline tutorial videos ](https://sites.google.com/view/arena-unity/home/tutorials-baselines?authu…

yuchen-x updated 4 years ago
1
ray-project/ray #37413

[RLlib] Framework "tf2" raises error in `MLPEncoderConfig`

### What happened + What you expected to happen # What happened I ran `PPO` with `RLModule` and `_enable_learner_api=True` using `framework="tf2"`. The following error occurred: ``` Failu…

simonsays1980 updated 1 year ago
8
PKU-MARL/DexterousHands #4

Possible bug? (ValueError: The parameter loc has invalid val…

Hello, First, let me thank you for open-sourcing this great framework. However, I am unable to run the training without getting the following error: ``` Traceback (most recent call last): Fi…

sheffier updated 2 years ago
2
Farama-Foundation/Gymnasium-Robotics #141

[Question] Which of the MAMuJoCo environments are even "solv…

### Question **TL;DR: do you have baselines for performance on the environments using some popular MARL algorithm, say MADDPG or other?** Hi there, first of all, thanks for maintaining MAMuJoCo. I…

jcformanek updated 1 year ago
7
adityab/CrossQ #6

Some tasks from deepmind/* not working

Hello, I am trying to benchmark your code on more tasks from deepmind/* but they are not working. There seems to be a bug in the `prepare_obs` function in `sbx/common/policies.py`. I attach stack tra…

JankowskiChristopher updated 4 months ago
2
Acellera/acegen-open #51

Issues using a custom pretrained Huggingface model (loading …

First, thanks for the amazing repository! I wanted to load a pretrained model from Huggingface, which typically creates a folder with the `config.json` and the .bin file containing the weights inside.…

hunklinger updated 2 weeks ago
5
rlcode/reinforcement-learning #76

couple a3c questions / recommendations for generalizing beyo…

First, thanks for making this. It's very easy to get started with and has really helped me move things forward on a personal project of mine I've been struggling with for months. This is really awesom…

M00NSH0T updated 6 years ago
3
forester-bt/forester #19

Rl and dynamical replacement of the nodes

besok updated 1 year ago
1
hill-a/stable-baselines #1044

[question] Issue with multiple instances for DDPG-MPI from s…

Hello, I am pretty new to MPI. I am using stable-baselines DDPG for a custom environment. Everything is working fine and I am getting good results as well. Question: When I use MPI and run the co…

UtkarshMishra04 updated 3 years ago
5

上一页 1...9 10 11 12 13 14 15...76 下一页

751 results for actor-critic-algorithm

751 results
for actor-critic-algorithm