ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mplp/docassemble-MLHPPOAndProposedOrder #196

Tester feedback: bug (`children[0].desired_parenting_time_ch…

  |   -------|------------------------------------ Question ID | `desired parenting time change for individual child` Variable sought | `children[0].desired_parenting_time_changes` Package v…

MPLP-Docassemble updated 2 days ago
1
mplp/docassemble-MLHPPOAndProposedOrder #195

Tester feedback: something (`children[0].lives_with`)

  |   -------|------------------------------------ Question ID | `who does child love with?` Variable sought | `children[0].lives_with` Package version | `playground` Form | `docassemble.pla…

MPLP-Docassemble updated 2 days ago
1
ray-project/ray #39519

[RLlib] Make `_check_if_diag_gaussian` available in the util…

### Description Right now the function is in `ppo_catalog.py` but will be used by many `RLModule` subclasses. Make the function available in a more central place like `rllib.utils`. ### Use case ``…

simonsays1980 updated 1 year ago
1
ray-project/ray #31783

[RLlib] AlgorithmConfig() defaults not used by build_sac_mod…

### What happened + What you expected to happen - [x] I searched for related issues and did not find anything matching. The closest issue(s) are: https://github.com/ray-project/ray/issues/22747 and…

gresavage updated 1 year ago
2
Stable-Baselines-Team/stable-baselines3-contrib #68

[Feature Request] Better support for action masking for vect…

**Motivation** Stable-baselines3 (SB3) has introduced support for action masking (see [here](https://sb3-contrib.readthedocs.io/en/master/modules/ppo_mask.html)), which is a great feature. However, t…

BolunDai0216 updated 1 year ago
2
eugval/sim2real_dynamics_simulation #1

TypeError: __init__() got an unexpected keyword argument 'di…

Hi! When I run the code, such as python ppo_multiprocess.py #or python lstm_td3_multiprocess.py #or python td3_multiprocess.py ...... I encounted a same issue: **TypeError: __init__() got an …

Diankuang-Wu updated 3 years ago
5
Replicable-MARL/MARLlib #217

Discrete action space switching continuous action space prob…

Hello developers, I am trying to customize aircombat env, the aircombat environment included in MARLlib, but I have encountered some problems in the post-customization training process, specifi…

shengqie updated 9 months ago
1
astooke/rlpyt #115

Normalizing environment wrapper

For Mujoco envs, i's a standard practice to normalize rewards by a running estimate of their standard deviation (e.g. VecNormalize in baselines, NormalizedEnv in rllab). Without it, performance is not…

vzhuang updated 4 years ago
4
andriusbern/NaoRL #2

ArgumentError: wrong type, client_id = None

Hi, thank you for your work. I was going to try it with a Nao robot after the vrep simulation. When I try to train the model in Vrep at NaoBancing Env with ppo, I got a ArgumentError. `return c_St…

zfwang615 updated 4 years ago
1
hiyouga/LLaMA-Factory #5407

PPO训练问题

### Reminder - [X] I have read the README and searched the existing issues. ### System Info dcu，dtk2404，python3.10进行ppo训练 ### Reproduction 1 /home/zkhy/largeModel/PretrainedModels/qwen/Qwen2-1.5…

yang-chenyu104 updated 19 hours ago
1

上一页 1...86 87 88 89 90 91 92...100 下一页

1000+ results for ppo

1000+ results
for ppo