td3bc Search Results - Githubissues

tianheyu927/mopo #5

Questions about the number in the paper

Hi, I really appreciate your open source code. My question is how is your performance number reported in the paper. For example, in Table 1, do you use the max evaluation return during the learni…

MSRA-COLT updated 2 years ago

pickxiguapi/Clean-Offline-RLHF #3

Missing code and data for certain feedback types

Dear authors, thank you for the outstanding work on this project! In your paper, you mention that the system was evaluated using comparative, attribute, and keypoint feedback. However, it seems …

thomas475 updated 4 months ago

YangRui2015/RORL #3

Some problems about the implementation of RORL

Hi, I am currently encountering some issues while trying to implement RORL，here is the problems： 1. The training time for RORL seems to be quite long (due to the additional calculation of 3 los…

awecefil updated 8 months ago

tinkoff-ai/CORL #25

The results about td3_bc on Antmaze

Hi May I ask the setting about td3_bc on antmaze. I find current hyperparameters can not work well and obtain a similar result as in the paper. Best

lucasliunju updated 1 year ago

fuyw/jrlzoo #10

About the performanfe of td3bc on antmaze

Hi I try to run the code of td3bc on antmaze-medium-play-v0 and I find the performance (is 0) is lower than the results in iql paper (10.6). May I ask the experimental setting about that? Thank …

sglucas updated 1 year ago

521xueweihan/HelloGitHub #2513

【开源自荐】 DI-engine - 通用的决策智能引擎

## DI-engine - 项目地址：https://github.com/opendilab/DI-engine - 类别：Python、机器学习 - 项目标题：DI-engine 是一个基于 PyTorch 和 JAX 的通用决策智能引擎。 - 项目描述： **DI-engine** 以 **python-first** 和 **asynchronous-nati…

VaninaY updated 1 year ago

Farama-Foundation/D4RL #192

[Bug Report] gym 0.24.1 get error to run adroit environment

With the gym version 0.24.1, I got an error to run environment "pen-human-v0", which is from the calling `env.reset()` ``` Traceback (most recent call last): File "train_offline_td3bc.py", line…

egg-west updated 1 year ago

microsoft/ATAC #2

Question about D4RL MuJoCo benchmark

Thanks for sharing the codes. I have one question. It seems like you are using D4RL v2 (C.2.), and in Table 1 you mention that "the baseline results are from the respective papers". However, some pr…

HYDesmondLiu updated 2 years ago

takuseno/d3rlpy #129

Saving and loading TD3PlusBC models

**Describe the bug** Saving and loading TD3PlusBC models do not appear to be working. **To Reproduce** The following procedure works for CQL but not for TD3PlusBC. ``` import d3rlpy dataset,…

osogami updated 3 years ago

9 results for td3bc

9 results
for td3bc