-
In the rlkit there is no module called rlkit.data_management.load_buffer
-
I want to try SAC with UR5e Lift OSC Pose environment, so I modify the variant of Panda like this
```
{
"algorithm": "SAC",
"algorithm_kwargs": {
"batch_size": 128,
"eval_max_path_…
-
https://github.com/aviralkumar2907/CQL/blob/d67dbe9cf5d2b96e3b462b6146f249b3d6569796/d4rl/rlkit/torch/sac/cql.py#L241
`q1_next_actions = self._get_tensor_values(obs, new_curr_actions_tensor, network=…
-
In code "cql_mujoco_new.py", define expl_path_collector
[https://github.com/aviralkumar2907/CQL/blob/master/d4rl/examples/cql_mujoco_new.py#L65](url)
[https://github.com/aviralkumar2907/CQL/blob…
-
I believe I discovered a potential error for hopper in D4RL (I've fetched the latest version of the dataset). I printed out the number of terminals and timeouts, and calculated per trajectory reward a…
-
Hi. I see you are the coauthor of the VIREL paper and I opened an accompanying issue at https://github.com/AnujMahajanOxf/VIREL/issues/1. I am wondering if you know the exact commit of https://github.…
-
Greetings.
Thank you for your amazing work on Offline RL, as well as for open-sourcing the code.
This present issue pertains to the computation for the lower bounding component of the SAC CQL:
…
-
Hello,
Thanks for the resource.
It would be nice to implement [Critic Regularized Regression (CRR)](https://arxiv.org/abs/2006.15134) or [Advantage Weighted Actor Critic (AWAC)](https://arxiv.org/…
-
Hi, thanks for releasing your code for reproduction. However, due to the lack of the rlkit/launchers/config.py, I do not know how to appropriately modify it and run the experiments. Would you please c…
-
Hello,
Thank you for the work on this code.
I try to train a Offline RL agent on flow, but I'm unable to save the model, I get the following error :
`Traceback (most recent call last):
Fil…