gail-ppo Search Results

178 results
for gail-ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

twni2016/pomdp-baselines #21

MuJoCo：hyperparameters used for PPO_GRU and A2C_GRU

Dear author, I have read your paper on MuJoCo experiments and I am particularly interested in the hyperparameters used for PPO_GRU and A2C_GRU. I would greatly appreciate it if you could provide me…

floraljq updated 3 months ago
1
HumanCompatibleAI/imitation #818

Tensorboard Logging

## Problem How can I log the training rewards etc. to a Tensorboard log for the example GAIL training script? ``` import numpy as np import gymnasium as gym from stable_baselines3 import PPO …

mertalbaba updated 7 months ago
1
PKU-EPIC/UniDexGrasp2 #2

A Question About Dagger Value Algorithm

Hi, I have a question about dagger-value algorithm: when updating value network, why do you use `torch.max()` to get the larger loss? What's the meaning comparing these two losses? In my understa…

yangyichu updated 7 months ago
2
Unity-Technologies/ml-agents #5765

GAIL+PPO+BC Suddenly Perform Worse

I am currently training my agent to do two things :- 1) Collect And Deposit Food 2) Take shelter on occurrence of certain things in the environment. I am using gail and Behavioral Cloning with PPO to …

AbhijitBaruah updated 10 months ago
5
HumanCompatibleAI/imitation #816

Example notebook "1_train_bc.ipynb" gives "Namespace not fou…

## Bug description Example notebook [1_train_bc.ipynb](https://github.com/HumanCompatibleAI/imitation/blob/master/docs/tutorials/1_train_bc.ipynb) gives ```Namespace not found error``` for ```seals``…

Rajesh-Siraskar updated 7 months ago
3
HumanCompatibleAI/imitation #680

GAIL and AIRL don't work

## Bug description Your adversarial model implementations, including GAIL and AIRL, does not work well in MuJoCo environments. Tested on Hopper, HalfCheetah and Humanoid, and both AIRL and GAIL faile…

mertalbaba updated 8 months ago
5
OpenRL-Lab/openrl #248

[Bug]: Got an unexpected keyword argument 'cfg'

### 🐛 Bug Having problem while running Atari example code on both stable and main branch It seems due to the cfg didn't correctly pass to gymnasium make ### To Reproduce ``` python train_pp…

Error0229 updated 9 months ago
1
HumanCompatibleAI/imitation #763

Ensure all tutorials work as expected

Some of the tutorials contained hyperparameters, that were not quite optimized. Also in some cases we say "increase this value to `x` to get actually good results". We should verify that those claims …

ernestum updated 7 months ago
8
opendilab/DI-engine #692

`GAIL` the algorithm performs much worse than from its origi…

*Before proposing this issue, I have searched it on document, issues and search engine.* My job is to reproduce the excellent performance of `GAIL` over `BC` in the setting of `Cartpole`, where the…

shenpengfii updated 1 year ago
6
HumanCompatibleAI/imitation #691

Continue to train PPO after GAIL

## Problem Hi, I'm excited to use this amazing project. I have an idea about GAIL-PPO. GAIL has the generator network and the discriminator network, while ppo has the actor network and the critic …

Liuzy0908 updated 1 year ago
3

上一页 1...2 3 4 5 6 7 8...18 下一页

178 results for gail-ppo

178 results
for gail-ppo