ppo-pytorch Search Results

vwxyzjn/cleanrl #471

Setup issue

Trying to follow the simple README instructions on an Ubuntu server with 2x 4090 GPUs and CUDA 12.4: ```bash Installing the current project: cleanrl (2.0.0b1) (cleanrl) ➜ cleanrl git:(master) po…

catid updated 1 day ago

pytorch/torchtune #812

[RFC] Proximal Policy Optimisation

# Implementing Proximal Policy Optimisation I've used some of the [PyTorch RFC](https://github.com/pytorch/rfcs/blob/master/README.md) template here for clarity. **Authors:** * @salmanmohammadi…

SalmanMohammadi updated 2 weeks ago

openai/spinningup #384

Pytorch PPO Implementation, dimension difference

Hello, apologies if I do this wrong I don't contribute to open source often. I was attempting to run the Pytorch PPO implementation and kept getting several errors regarding the dimension of the obser…

kevin-mahon updated 1 year ago

uvipen/Super-mario-bros-PPO-pytorch #19

AttributeError: 'Monitor' object has no attribute 'pipe'

While testing the model i get this: Traceback (most recent call last): File "test.py", line 65, in test(opt) File "test.py", line 55, in test state, reward, done, info = env.step(act…

VicenHe updated 3 months ago

pytorch/pytorch #93697

[User model][tracker] Improve compilation of PPO model (Stab…

`pip install stable-baselines3[extra]` ## Repro ```python from stable_baselines3 import PPO import torchdynamo @torchdynamo.optimize("inductor") def train(): model = PPO("MlpPoli…

msaroufim updated 1 month ago

huggingface/trl #1783

Clarification on reward/value heads in PPOV2

First, thank you for your efforts in helping to bring accurate and performant RLHF techniques to the open-source community. I'm raising this issue hoping to get some clarification on a couple implem…

SalmanMohammadi updated 2 weeks ago

ikostrikov/pytorch-a2c-ppo-acktr-gail #191

Env Observation space dimension error whenever I try to trai…

Hello, I followed steps mentioned [here](https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail#requirements) to install requirements for this repository. There is one minor change, I am using vi…

ashwinipokle updated 5 years ago

nikhilbarhate99/PPO-PyTorch #67

(Solved) No env.reset() at the end of each training epoch.

【**Existing code:**】 Only reset the environment at the beginning of training loop, that is, only call env.reset() at the first epoch. 【**Right(might) training paradigm**】 I checked OpenAI spinning-…

slDeng1003 updated 2 months ago

ikostrikov/pytorch-a2c-ppo-acktr-gail #201

error happened when running with ppo

env--reacher algo--ppo error: Traceback (most recent call last): File "/home/al/Desktop/pytorch-a2c-ppo-acktr-gail-master/main.py", line 196, in main() File "/home/al/Desktop/pytorch-…

tangypnuaa updated 4 years ago

HumanCompatibleAI/imitation #781

Got an unexpected keyword argument 'use_sde' when passing be…

## Bug description Hello, I want to pass the policy learned from behavioural cloning in imitation library to PPO, I thought it would be successful since they are both from ActorCriticPolicy class,…

JkAcktuator updated 2 months ago

1000+ results for ppo-pytorch

1000+ results
for ppo-pytorch