-
Did this code reproduce the results reported in the [paper](https://openreview.net/forum?id=OJiM1R3jAtZ)? Did you rest on other environments/data?
-
# 创建激活环境
创建Conda 环境,这里取Python3.7,因为这是TensorFlow 1.X 的最后支持版本,之后的Python只能用TensorFlow 2.0之后的版本了。2.0 大改,很多老代码用不了。
```bash
conda create -n offline python=3.7
```
conda 重新初始化一下。
```bash
conda…
-
when I try to install this, I encounter "ERROR: Command errored out with exit status 128: git clone -q git://github.com/deepmind/dm_control /tmp/pip-install-z7djkhsc/dm-control_750010d3edcd47c991d4346…
-
First, thank you for sharing the repo!
The dataset seems to consists of state-action pairs, is there a way to recover entire rollout of a policy?
-
**Describe the bug**
The docstring for `qlearning_dataset()` says:
```
terminate_on_end (bool): Set done=True on the last timestep
in a trajectory.
```
However, if you look a…
-
Hello,
I am wondering learning results of `Ant-v2`'s dataset, like ant-random-v0/2 ant-medium-v0/v2. I think it is not listed in d4RL original paper but it's supplemented in github later.
Do y…
-
I'm having a hard time figuring out how qlearning dataset is being built.
As mentioned by @odelalleau in https://github.com/Farama-Foundation/D4RL/issues/182, the `"terminals"` key in some env is ne…
-
When I download a maze2d dataset with `env.get_dataset()`, the downloaded hdf5 file contains "timeouts" key:
```
>>> import d4rl, gym
>>> env = gym.make('maze2d-large-v1')
>>> dataset = env.get_da…
-
Hi, I'm trying to generate the ant maze dataset using the generation script but getting a "No module named 'locomotion.ant'" error when loading the policy `load_policy('ant_hierarch_pol.pkl')`. I inst…
-
Hello, @aviralkumar2907 . Thanks for sharing the CQL code.
For the MuJoCo script, I've noticed that `seed` value is not actually used anywhere in the script.
https://github.com/aviralkumar2907/CQ…