rlkit Search Results - Githubissues

196 results
for rlkit

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

avisingh599/cog #1

No module named 'rlkit.data_management.load_buffer'

In the rlkit there is no module called rlkit.data_management.load_buffer

SHITIANYU-hue updated 3 years ago
2
ARISE-Initiative/robosuite-benchmark #4

Bug with UR5e Lift OSC Pose Experiment

I want to try SAC with UR5e Lift OSC Pose environment, so I modify the variant of Panda like this ``` { "algorithm": "SAC", "algorithm_kwargs": { "batch_size": 128, "eval_max_path_…

chongyi-zheng updated 3 years ago
2
aviralkumar2907/CQL #10

code bugs

https://github.com/aviralkumar2907/CQL/blob/d67dbe9cf5d2b96e3b462b6146f249b3d6569796/d4rl/rlkit/torch/sac/cql.py#L241 `q1_next_actions = self._get_tensor_values(obs, new_curr_actions_tensor, network=…

rainbow979 updated 3 years ago
2
aviralkumar2907/CQL #2

Function argument problem about expl_path_collector.collect_…

In code "cql_mujoco_new.py", define expl_path_collector [https://github.com/aviralkumar2907/CQL/blob/master/d4rl/examples/cql_mujoco_new.py#L65](url) [https://github.com/aviralkumar2907/CQL/blob…

SongyiGao updated 3 years ago
1
Farama-Foundation/D4RL #86

Error in hopper replay datasets

I believe I discovered a potential error for hopper in D4RL (I've fetched the latest version of the dataset). I printed out the number of terminals and timeouts, and calculated per trajectory reward a…

zhihanyang2022 updated 3 years ago
10
Mattief/VIREL #1

Which version of rlkit do you use?

Hi. I see you are the coauthor of the VIREL paper and I opened an accompanying issue at https://github.com/AnujMahajanOxf/VIREL/issues/1. I am wondering if you know the exact commit of https://github.…

jeffwillette updated 4 years ago
2
aviralkumar2907/CQL #4

SAC CQL: Potential mismatch between observations and actions…

Greetings. Thank you for your amazing work on Offline RL, as well as for open-sourcing the code. This present issue pertains to the computation for the lower bounding component of the SAC CQL: …

dosssman updated 3 years ago
4
takuseno/d3rlpy #4

[REQUEST] Implement CRR/AWAC

Hello, Thanks for the resource. It would be nice to implement [Critic Regularized Regression (CRR)](https://arxiv.org/abs/2006.15134) or [Advantage Weighted Actor Critic (AWAC)](https://arxiv.org/…

araffin updated 4 years ago
9
KamyarGh/rl_swiss #1

rlkit/launchers/config.py missing

Hi, thanks for releasing your code for reproduction. However, due to the lack of the rlkit/launchers/config.py, I do not know how to appropriately modify it and run the experiments. Would you please c…

Ericonaldo updated 4 years ago
1
Farama-Foundation/D4RL-Evaluations #6

Socket object in the snapshot ?

Hello, Thank you for the work on this code. I try to train a Offline RL agent on flow, but I'm unable to save the model, I get the following error : `Traceback (most recent call last): Fil…

Thibaud-Ardoin updated 4 years ago
1

上一页 1...9 10 11 12 13 14 15...20 下一页

196 results for rlkit

196 results
for rlkit