rl-unplugged Search Results

68 results
for rl-unplugged

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-research/batch_rl #24

Why use a small batch size?

Hi, May I ask why you have used such a small batch size? Since you have mentioned in the paper that a larger batch size would lead to a significant speed up. Why still 32 in the standard implementat…

shengchao-y updated 3 years ago
2
JuliaReinforcementLearning/ReinforcementLearning.jl #138

In DDPG: Add support for vector actions

Currently only scalar actions are supported, how far up on the priority list is it to expand on this?

oysteinsolheim updated 3 years ago
3
google-research/batch_rl #13

Native Resolution Images

Hi, I've been trying to use the data from rl unplugged in its native resolution (210x160). I hoping to replay the rl unplugged actions from the sequential data release for dopamine into the environ…

MatthewChang updated 3 years ago
1
google-deepmind/deepmind-research #157

[RL Unplugged] Distribution of Various Runs

Is the distribution of the five runs for each dataset the same? Or does run 1 refer to data collected by a policy at the beginning stages of training while run 5 for the last stage of training?

arjung128 updated 3 years ago
1
google-deepmind/deepmind-research #156

[RL Unplugged] Gravitar loss starts at ~0.000

I am using the code in atari_dqn.ipynb to train a policy for Gravitar from scratch (on 1 run = 100 shards of data), and this is what my loss log looks like so far: ``` [Learner] Loss = 0.002 | Ste…

arjung128 updated 3 years ago
1
google-research/batch_rl #15

Data generation is very slow

I am using the following command to try and generate data for one run of Freeway: ``` python -um batch_rl.baselines.train \ --base_dir=/tmp/batch_rl_data \ --gin_files='batch_rl/baselines/co…

muzerobot updated 3 years ago
4
google-deepmind/deepmind-research #74

rl_unplugged/rwrl_d4pg.ipynb does not reproduce

The notebook is easy to get running, kudos for that. However the results do not match the repository. When I run it the output of "Training Loop" is: ``` [Learner] Critic Loss = 4.062 | Policy L…

pmineiro updated 4 years ago
6
google-research/batch_rl #9

Can I train with my own dataset?

Thanks for the great work first! I have a bunch of data in **_(state, action, reward, next state)_** format. I try to understand how you guys parse the $store$_action_ckpt file in the code but I fail…

Marchen0 updated 3 years ago
4
pwr-Solaar/Solaar #1615

Logitech G Pro Wireless Headset sidetone, equalizer, and bat…

- Solaar version (`solaar --version` and `git describe --tags`): solaar 1.1.3 - Distribution: Arch Linux - Kernel version (ex. `uname -srmo`): Linux 5.17.9-arch1-1 x86_64 GNU/Linux - Outpu…

tarneaux updated 2 years ago
90
google-research/batch_rl #8

Retraining the online agent

I'd like to retrain the online DQN agent in order to log some additional data during online training. The README says > This data can be generated by running the online agents using batch_rl/basel…

n17s updated 4 years ago
2

上一页 1...1 2 3 4 5 6 7...7 下一页

68 results for rl-unplugged

68 results
for rl-unplugged