ppo-pytorch Search Results

1000+ results
for ppo-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeedExamples #786

step 3 "run_6.7b_lora.sh" doesn't work with a100 80GB single…

> ``` > # Copyright (c) Microsoft Corporation. > # SPDX-License-Identifier: Apache-2.0 > > # DeepSpeed Team > > > ACTOR_ZERO_STAGE="--actor_zero_stage 0" > CRITIC_ZERO_STAGE="--critic_zero_…

sophus1004 updated 4 months ago
1
xwhan/walk_the_blocks #1

it can just run 2 times and then no response

(pytorch) ➜ srcs git:(master) ✗ python learn_by_ppo.py 0%| | 0/11871 [00:00

walegahaha updated 6 years ago
2
Learning4Optimization-HUST/H-TSP #3

train和evaluate报错

train.py 和config_ppo.yaml 中low_level_load_path是如何生成的 evaluate.py中设置了lower_model和upper_model 报错Encoder type cnn not supported! 4个upper_model都试过了加载后的encoder_type是cnn而非pixel 有更详细的训练或者验证介绍嘛

laborer123 updated 2 months ago
5
ronsailer/a2oc_pytorch #1

About the result of Pong

Dear ronsailer, I'm very sorry to trouble you. First, thanks for your contribution, and I am running the code on Pong and can not get a better result. So I want to ask whether you do this experiment.…

lucasliunju updated 6 years ago
25
unslothai/unsloth #304

Error with PPO training about hidden state in-place modifica…

Code to reproduce: ```python import trl from unsloth import FastLanguageModel import torch from tqdm import tqdm from transformers import AutoTokenizer from datasets import load_dataset fr…

Robinysh updated 4 months ago
12
lcswillems/rl-starter-files #13

Support for MiniWorld (3D indoor environment)?

Hi Lucas, I've been working on my 3D indoor environment. It's still very basic, but it works, and I just made the repository public: https://github.com/maximecb/gym-miniworld I've tried to adjus…

maximecb updated 5 years ago
36
microsoft/nni #4783

please add the ppo_tuner performance in comparison of hpo al…

**Describe the issue**: Hi, Could you please add the ppo_tuner performance in comparison of hpo algorithms: https://nni.readthedocs.io/en/latest/sharings/hpo_comparison.html Thanks a lot! …

southisland updated 2 years ago
1
gouxiangchen/ac-ppo #1

可以做一个single policy的版本么

不使用double policy的请看下，你的代码可以收敛么？我听别人说直接使用tanh再distribution 后sample会影响熵的计算，不知道为什么，可以问下么

heyfavour updated 3 years ago
6
isaac-sim/IsaacGymEnvs #62

`libmem_filesys.so: cannot open shared object file`

Running into the issue of `libmem_filesys.so: cannot open shared object file`. I tried googling but could not find any info on this file Additionally, any chance the Preview 3 and prior version…

vwxyzjn updated 1 month ago
17
ray-project/ray #46631

[RLlib] Observation space with 2 dimensions not working with…

### What happened + What you expected to happen I want to use an environment with an observation space of 2 dimensions with the new API stack but I'm unable to do so as the `_get_encoder_config` me…

grizzlybearg updated 2 weeks ago
5

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for ppo-pytorch

1000+ results
for ppo-pytorch