ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #40205

[RLlib][PPOConfig] ComplexInputNet not automatically selecte…

### What happened + What you expected to happen I have a simple gymnasium observation space which is made of 1 float box and 1 image. Using ImpalaConfig, ComplexInputNet it is clear chosen becaus…

NDR008 updated 11 months ago
2
agi-brain/xuance #34

Can more benchmark results of different agents on more vario…

目前在文档中看到本项目实现了非常丰富的智能体模型算法，以及不同类型Env的适配，但是好像具体的benchmark试验结果汇总比较有限，存在大量的结果缺失，例如[Atari](https://xuance.readthedocs.io/zh/latest/documents/benchmark/atari.html)、MPE、MAgent等均无试验结果展示，仅有的Mujoco试验结果也不是很完整，仅…

nicklhy updated 4 months ago
1
Edward-Sun/easy-to-hard #10

How do I convert the PPO trained model (.pt) into hf format …

How do I convert the PPO trained model (.pt) into hf format? I tried to use this file to convert using. The following command: ```shell python scripts/convert_checkpoint_to_hf.py \ --…

supermancmk updated 2 days ago
1
OptimalScale/LMFlow #862

[Roadmap] LMFlow Roadmap

This document includes the features in LMFlow's roadmap. We welcome any discuss or contribute to the specific features at related Issues/PRs. 🤗 ### Main Features * Data * [x] DPO dataset format…

wheresmyhair updated 1 week ago
2
anyscale/academy #54

IndexError on calling ppo.PPOTrainer(config, env = SELECT_EN…

I follow the [suggestion ](https://docs.ray.io/en/latest/rllib-training.html#specifying-resources), `config["framework"] = "torch"` `config["num_gpus"] = 0.001 # can't work` `con…

Wormh0-le updated 3 years ago
1
rlworkgroup/garage #1020

PyTorch on CPU is slower than TF

See https://github.com/pytorch/pytorch/issues/975 for more info PyTorch TRPO appears 50% slower than TF. Not sure about PPO, but I expect the wall-clock time gap will be the same. To fix this is…

ryanjulian updated 3 years ago
4
huawei-noah/SMARTS #990

ray.memory_monitor.RayOutOfMemoryError

## **BUG REPORT** **High Level Description** Hi! Why does the latest version still have this bug？ **SMARTS version** [0.4.17] **Error logs and screenshots** ![image](https://user-images.gi…

Meta-YZ updated 3 years ago
12
aidudezzz/deepworlds #105

Gym reset method seed argument mismatch - find_and_avoid_v2

Hello, I am currently working with stable-baselines3 (version 1.8.0) and the sb3-contrib PPO Mask algorithm for my custom environment, FindAndAvoidV2RobotSupervisor. While running the code, I encou…

wayne-weiwei updated 4 days ago
2
haosulab/SAPIEN #157

RuntimeError: GPU PhysX can only be enabled once before any …

**System:** - Google Colab, L4 GPU - 1_[quickstart.ipynb](https://colab.research.google.com/github/haosulab/ManiSkill/blob/main/examples/tutorials/1_quickstart.ipynb) of ManiSkill, using sapien …

erwincoumans updated 4 months ago
2
isaac-sim/IsaacGymEnvs #163

How can I use the SAC algorithm (instead of the default PPO)…

https://github.com/NVIDIA-Omniverse/IsaacGymEnvs/blob/main/docs/rl_examples.md The above website says that Ant tasks can be trained using the SAC algorithm, but there is no specific modification of t…

chenci107 updated 10 months ago
1

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for ppo

1000+ results
for ppo