d4rl Search Results - Githubissues

400 results
for d4rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

aviralkumar2907/BEAR #7

In place operations in algos.py

I keep getting this error due to some in place changes to the variable a in sample_multiple: `[W python_anomaly_mode.cpp:60] Warning: Error detected in AddmmBackward. Traceback of forward call that…

rrags updated 1 year ago
3
thu-ml/tianshou #513

RNN for continuous CQL algorithm

- [X] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [X] ne…

BFAnas updated 1 year ago
15
mfinzi/residual-pathway-priors #7

Humanoid-v2 task of RL experiment doesn't work.

Hi, I tried the following command to test the Humanoid-v2 task `python train_rpp.py --env_name=Humanoid-v2 --save_dir=./tmp/rpp --rpp_value=False` However, it didn't work and raised those error mes…

kim-hyunsu updated 2 years ago
2
beanie00/Decision-ConvFormer #4

[Performance] How to achieve reported performance for DC mod…

Hello, I’ve been using the official recommended script as follows: ```bash # dc python3 experiment.py --env hopper --dataset medium --model_type dc --K 8 --embed_dim 256 --learning_rate 0.0001…

XiudingCai updated 2 months ago
2
takuseno/d3rlpy #422

[REQUEST] your pip install should not overwrite existing pac…

**Is your feature request related to a problem? Please describe.** The current `pip install d3rlpy` installs a bunch of new packages and upgrades existing packages without user's consent. This terrib…

pwyq updated 1 month ago
2
hjsuh94/score_po #43

Topics for Meeting 04/24

Overall, we should try to focus our efforts towards what's necessary for the paper. 1. DataDistance vs. ScoreMatching - do we also want to show that optimal control w/ data distance penalty is emp…

hjsuh94 updated 1 year ago
1
Farama-Foundation/D4RL #215

[Question] Expert score for maze2d environment may be wrong

# Summary 1. There are issues with the scoring calculation of expert strategies in the maze2d environment. 2. The incorrect scoring calculation is a result of the expert strategies not being called …

onceagain8 updated 8 months ago
2
takuseno/d3rlpy #58

[REQUEST] Adding model-based offline RL with image inputs li…

**Is your feature request related to a problem? Please describe.** Model-based offline RL algorithms which are able to handle image inputs are necessary for some environments. **Describe the solut…

kargarisaac updated 3 years ago
8
Farama-Foundation/Minari #74

[Question] Please, state clearly in the documentation and da…

### Question Hi, please state clearly in the documentation and dataset definition if in a time step "r_0" is consequence of "a_0" With previous Offline RL libs, there has been some confusion wi…

jamartinh updated 1 year ago
3
hjsuh94/score_po #40

Thoughts on choosing experiments

What do we want out of our experiments? In the setting of offline RL, we want our algorithm to 1. Achieve reasonable success on the task 2. Show that adding distribution risk improves over vanilla …

hjsuh94 updated 1 year ago
5

上一页 1...9 10 11 12 13 14 15...40 下一页

400 results for d4rl

400 results
for d4rl