-
### ❓ Question
Hi, I'm excited to use this amazing project.
I'm implementing GAIL-PPO. GAIL has the generator network and the discriminator network, while ppo has the actor network and the critic …
-
## Bug description
While running the following code, this error occurs:
```
Traceback (most recent call last):
File "main.py", line 73, in
gail_trainer.train(20000)
File "/local/home/.…
-
## Bug description
I want to use the frame stacking technique (4 consecutive frames of images as model input), which works well in PPO-only in SB3.
But after running the above program (about GAIL)…
-
Hi,
builds running in firewalled networks might fail. Would it be possible to convert to https URIs?
Thanks.
-
## Bug description
When trying to train GAIL on Humanoid, always get variable horizon error. I am using the code provided on your documentation, which is written below.
## Steps to reproduce
```…
-
## DI-engine
- 项目地址:https://github.com/opendilab/DI-engine
- 类别:Python、机器学习
- 项目标题:DI-engine 是一个基于 PyTorch 和 JAX 的通用决策智能引擎。
- 项目描述:
**DI-engine** 以 **python-first** 和 **asynchronous-nati…
-
I used ForwardIs func to get my model forward results. And I had a loop to call it.
It works well when my forward function of model only has 3 or lesser output.
But the goroutine crashed when the …
-
微博内容精选
-
Hi, I notice that the implementation for AIRL is not correct. You happens to use the reward signal for GAIL here.
https://github.com/Stanford-ILIAD/Confidence-Aware-Imitation-Learning/blob/1d8af0e4…
-
Hi @toshikwa
I'm puzzled with your annotation in `update_disc()`. You said output of discriminator is (-inf, inf), not [0, 1]. Should the output of disc be (-1, 1) when `hidden_activation` of the …