-
Hi, I really appreciate your open source code. My question is how is your performance number reported in the paper.
For example, in Table 1, do you use the max evaluation return during the learni…
-
Dear authors,
thank you for the outstanding work on this project!
In your paper, you mention that the system was evaluated using comparative, attribute, and keypoint feedback. However, it seems …
-
Hi,
I am currently encountering some issues while trying to implement RORL,here is the problems:
1. The training time for RORL seems to be quite long (due to the additional calculation of 3 los…
-
Hi
May I ask the setting about td3_bc on antmaze. I find current hyperparameters can not work well and obtain a similar result as in the paper.
Best
-
Hi
I try to run the code of td3bc on antmaze-medium-play-v0 and I find the performance (is 0) is lower than the results in iql paper (10.6). May I ask the experimental setting about that?
Thank …
-
## DI-engine
- 项目地址:https://github.com/opendilab/DI-engine
- 类别:Python、机器学习
- 项目标题:DI-engine 是一个基于 PyTorch 和 JAX 的通用决策智能引擎。
- 项目描述:
**DI-engine** 以 **python-first** 和 **asynchronous-nati…
-
With the gym version 0.24.1, I got an error to run environment "pen-human-v0", which is from the calling `env.reset()`
```
Traceback (most recent call last):
File "train_offline_td3bc.py", line…
-
Thanks for sharing the codes.
I have one question. It seems like you are using D4RL v2 (C.2.), and in Table 1 you mention that "the baseline results are from the respective papers". However, some pr…
-
**Describe the bug**
Saving and loading TD3PlusBC models do not appear to be working.
**To Reproduce**
The following procedure works for CQL but not for TD3PlusBC.
```
import d3rlpy
dataset,…