gail-ppo Search Results

178 results
for gail-ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DLR-RM/stable-baselines3 #1394

[Question] Modify actor‘s loss for GAIL-PPO

### ❓ Question Hi, I'm excited to use this amazing project. I'm implementing GAIL-PPO. GAIL has the generator network and the discriminator network, while ppo has the actor network and the critic …

Liuzy0908 updated 1 year ago
2
HumanCompatibleAI/imitation #668

Torch Cuda Error

## Bug description While running the following code, this error occurs: ``` Traceback (most recent call last): File "main.py", line 73, in gail_trainer.train(20000) File "/local/home/.…

mertalbaba updated 1 year ago
3
HumanCompatibleAI/imitation #692

FrameStack Bug.

## Bug description I want to use the frame stacking technique (4 consecutive frames of images as model input), which works well in PPO-only in SB3. But after running the above program (about GAIL)…

Liuzy0908 updated 1 year ago
1
mila-iqia/milabench #4

use https instead of git uris

Hi, builds running in firewalled networks might fail. Would it be possible to convert to https URIs? Thanks.

tbugfinder updated 1 year ago
6
HumanCompatibleAI/imitation #669

GAIL always raises variable horizon error

## Bug description When trying to train GAIL on Humanoid, always get variable horizon error. I am using the code provided on your documentation, which is written below. ## Steps to reproduce ```…

mertalbaba updated 1 year ago
3
521xueweihan/HelloGitHub #2513

【开源自荐】 DI-engine - 通用的决策智能引擎

## DI-engine - 项目地址：https://github.com/opendilab/DI-engine - 类别：Python、机器学习 - 项目标题：DI-engine 是一个基于 PyTorch 和 JAX 的通用决策智能引擎。 - 项目描述： **DI-engine** 以 **python-first** 和 **asynchronous-nati…

VaninaY updated 1 year ago
1
sugarme/gotch #75

ForwardIs may crash when the forward function of sasved mode…

I used ForwardIs func to get my model forward results. And I had a loop to call it. It works well when my forward function of model only has 3 or lesser output. But the goroutine crashed when the …

lieral updated 1 year ago
7
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 2 months ago
1907
Stanford-ILIAD/Confidence-Aware-Imitation-Learning #1

The reward signal for AIRL is not correct.

Hi, I notice that the implementation for AIRL is not correct. You happens to use the reward signal for GAIL here. https://github.com/Stanford-ILIAD/Confidence-Aware-Imitation-Learning/blob/1d8af0e4…

Altriaex updated 1 year ago
3
toshikwa/gail-airl-ppo.pytorch #8

About disc's output

Hi @toshikwa I'm puzzled with your annotation in `update_disc()`. You said output of discriminator is (-inf, inf), not [0, 1]. Should the output of disc be (-1, 1) when `hidden_activation` of the …

nicholas0717 updated 1 year ago
2

上一页 1...3 4 5 6 7 8 9...18 下一页

178 results for gail-ppo

178 results
for gail-ppo