gymnasium Search Results

1000+ results
for gymnasium

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LucasAlegre/morl-baselines #102

Implementation of MOPPO fails

I encountered a Problem with implementation of the MO_PPO. There was a mismatch in dimensions of the _reward vector in the sync_vector_env environment. I worked around the issues by extending th…

thomaslautenb updated 3 months ago
3
DLR-RM/stable-baselines3 #1596

Error while using MaskablePPO in sb3_contrib

### 🐛 Bug Hi I switched from PPO to MaskablePPO and since then I'm facing problem with error. Interesting fact is that this error doesn't occur immediately, it occurs after 100k-300k timesteps. Ve…

koliber31 updated 3 weeks ago
2
ros2/rosbag2 #1678

[rosbag2_py.Recorder] Should add option to disable signal ha…

## Description We are launching a `rosbag2_py.Recorder()` instance inside of Python, and related to #1458, we do not want this to affect any surrounding control flow. Additionally, for now (perhap…

EricCousineau-TRI updated 5 months ago
7
ray-project/ray #39807

[Rllib, Core, Tune] observation = OrderedDict(sorted(observa…

### What happened + What you expected to happen I want to train a PPO agent in my custom environment called RankingEnv, but I'm encountering several errors and warnings that result in the agent's t…

fardinabbasi updated 1 year ago
6
DLR-RM/stable-baselines3 #1977

[Question] Questions about CNN policy input channel

### ❓ Question Hello, I am learning how to implement the costum CNN policy and environment with the stablebaseline 3. I am following the example "Custom Feature Extractor" in this link: https://s…

DavidLudl updated 4 months ago
2
haosulab/ManiSkill #520

[Question] `max_episode_steps` for `num_envs>1`

The source code for the `step` function of `gymnasium.wrappers.time_limt.TimeLimit` is [as follows](https://github.com/openai/gym/blob/dcd185843a62953e27c2d54dc8c2d647d604b635/gym/wrappers/time_limit…

hesic73 updated 3 months ago
5
DLR-RM/stable-baselines3 #1973

[Question] Using SubprocVecEnv with a custom environment lea…

### ❓ Question The question seems complicated, but it is not. Given the following min example: ``` import gymnasium as gym from gymnasium import spaces from stable_baselines3.common.env_ut…

wilhem updated 4 months ago
3
microsoft/dp-transformers #26

Failed to run the example

I ran the example given ``` import os os.environ["WANDB_DISABLED"] = "true" !python examples/nlg-reddit/sample-level-dp/fine-tune-dp.py \ --output_dir scratch \ --model_name sshleifer/tiny-gpt2 …

xiehuanyi updated 5 months ago
12
DLR-RM/stable-baselines3 #1998

Logger information

### 🐛 Bug When log info to tensorboard, `self.logger.dump(step=self.num_timesteps)` is called after `self.logger.record` ```python self.logger.record("time/iterations", iteration, exclude="tensor…

XiaobenLi00 updated 1 month ago
1
haosulab/ManiSkill #422

Regarding the visualization error issue and the parallelizat…

Amazing work! However, I encountered some problems while using it. The first problem is, if I open more than one environment, i.e. num_env > 1, is it unreasonable for visualization? The code below…

yeshenpy updated 3 months ago
3

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for gymnasium

1000+ results
for gymnasium