ppo-pytorch Search Results

1000+ results
for ppo-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

nikhilbarhate99/PPO-PyTorch #46

policy.eval() after load_state_dict()

Dear Barhate, Hi!!! Thank you for your sharing of the code very much! I can reproduce your results and they look super cool! During my playing around, may I consult in file PPO.py. line 83, …

xinqin23 updated 3 years ago
1
glmcdona/LuxPythonEnvGym #89

Error in Kaggle submission

Hi, I have encountered error after kaggle submission. The following is error log from the game play in Kaggle. The game only plays for 1 turn and then stop. I used Python 3.7 to train the model ``…

hokhay updated 2 years ago
11
facebookresearch/nle #244

[Colab][Ray] Ray actor dies launching NLE on Google Colab

I tried launching NLE using ray.rllib on Colab: ``` import gym import nle from nle.env.tasks import NetHackChallenge import ray from ray.rllib.agents.ppo.ppo_torch_policy import PPOTorchPolicy …

adyomin updated 3 years ago
1
hmomin/FinEnvs #2

SAC Issues

@mugiwarakaizoku I'm having trouble getting SAC to learn Cartpole effectively. Below is sample output of one of the better trials, but in most trials, it can't even break above a total reward of 1…

hmomin updated 2 years ago
5
facebookresearch/habitat-lab #712

Change of Agent_0 Config in PPO PointNav Baseline causes cra…

Is it possible to change the configuration (HFOV of sensors, POSITION of sensors, RADIUS, HEIGHT) of Agent_0 in the PointNav Baseline? When I try to change the base task config, either by altering the…

RobertHalwass updated 3 years ago
2
HumanCompatibleAI/imitation #364

Trained PPO using GAIL does not work

Hi, thanks for the awesome project. I'm using following environment: - stable-baselines3 0.8.0 - imitation 0.2.0 I modified examples/quickstart.py as following. - Deleted BC and AIRL - Modi…

ghost updated 2 years ago
1
DLR-RM/stable-baselines3 #494

[Question] PPO/TRPO with tanh squash compared to not using i…

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [R…

xubo92 updated 3 years ago
2
DLR-RM/stable-baselines3 #528

[Question] PPO rollout with numsteps > episode length

### What does it mean when we roll out PPO with numsteps > episode length I know from the code that it will recycle the environment after you pass the terminal timestep. The question that I have is…

rhelpacc updated 3 years ago
2
HumanCompatibleAI/imitation #230

Repopulate S3 with Torch expert policies for `experiments/do…

I think it's still TF experts right now (incompatible with our repo since torch port). Addresses part of #215.

shwang updated 3 years ago
10
thu-ml/tianshou #338

Plans of implementing 3 classic model-free algorithm (TRPO/T…

## Purpose The purpose of this issue(discussion) is to introduce a series of prs in the near future targeted to releasing tianshou's full benchmark for MuJoCo Gym task suite. This benchmark will inc…

ChenDRAG updated 3 years ago
1

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for ppo-pytorch

1000+ results
for ppo-pytorch