-
|
-------|------------------------------------
Question ID | `desired parenting time change for individual child`
Variable sought | `children[0].desired_parenting_time_changes`
Package v…
-
|
-------|------------------------------------
Question ID | `who does child love with?`
Variable sought | `children[0].lives_with`
Package version | `playground`
Form | `docassemble.pla…
-
### Description
Right now the function is in `ppo_catalog.py` but will be used by many `RLModule` subclasses. Make the function available in a more central place like `rllib.utils`.
### Use case
``…
-
### What happened + What you expected to happen
- [x] I searched for related issues and did not find anything matching. The closest issue(s) are: https://github.com/ray-project/ray/issues/22747 and…
-
**Motivation**
Stable-baselines3 (SB3) has introduced support for action masking (see [here](https://sb3-contrib.readthedocs.io/en/master/modules/ppo_mask.html)), which is a great feature. However, t…
-
Hi! When I run the code, such as
python ppo_multiprocess.py
#or
python lstm_td3_multiprocess.py
#or
python td3_multiprocess.py
......
I encounted a same issue:
**TypeError: __init__() got an …
-
Hello developers, I am trying to customize aircombat env, the aircombat environment included in MARLlib, but I have encountered some problems in the post-customization training process, specifi…
-
For Mujoco envs, i's a standard practice to normalize rewards by a running estimate of their standard deviation (e.g. VecNormalize in baselines, NormalizedEnv in rllab). Without it, performance is not…
-
Hi, thank you for your work. I was going to try it with a Nao robot after the vrep simulation.
When I try to train the model in Vrep at NaoBancing Env with ppo, I got a ArgumentError.
`return c_St…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
dcu,dtk2404,python3.10进行ppo训练
### Reproduction
1 /home/zkhy/largeModel/PretrainedModels/qwen/Qwen2-1.5…