dqn-variants Search Results

63 results
for dqn-variants

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-deepmind/open_spiel #1158

Quoridor Movement Action IDs keep changing

So, I'm playing quoridor, and I was trying to figure out which action IDs corresponded to moving the agent. Therefore, after I placed all the walls, I went to look at the legal actions thinking these …

aadharna updated 4 months ago
38
fly51fly/aicoco #5

爱可可老师一周论文精选

fly51fly updated 4 months ago
106
pytorch/rl #1404

[Feature Request] Action Masking

## Motivation I recently started learning TorchRL, and creating a custom environment (using torchrl.envs.EnvBase) based on the documentation (https://pytorch.org/rl/reference/envs.html). For my envir…

Kang-SungKu updated 1 year ago
16
JuliaReinforcementLearning/ReinforcementLearning.jl #557

Refactor of DQN Algorithms

I've started refactoring the DQN implementations, but I'm fairly new to Julia so I'd appreciate your feedback about whether this is a good idea or not. In essence, it looks to me like there is lots…

harwiltz updated 1 year ago
3
ManifoldRG/NEKO_Archive #1

Dataset Availability Analysis

The datasets provided in the original GATO paper are varied and numerous. We need a preliminary analysis of what data is availability, what data has equivalents, and what data is not clearly source ab…

harshsikka updated 11 months ago
5
JuliaReinforcementLearning/ReinforcementLearning.jl #144

Recurrent Models

Does it this repo support recurrent models (LSTM for example)?

lorrp1 updated 1 year ago
9
x35f/unstable_baselines #54

OpenAi gym integration

I'd like to do the following but instead of SB3 I'd like to plug in unstable baselines. Is there a quick start guide or documentation somewhere that could help me get started? ``` import gym fro…

Karlheinzniebuhr updated 1 year ago
7
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 4 months ago
1907
MushroomRL/mushroom-rl #111

suspected memory leak

**Describe the bug** I run simple DQN on breakout atari game and the memory slowly increases, and after 20-30 epochs it takes 64GB of memory and after that keeps increasing. I use 1 million for the r…

davidenitti updated 1 year ago
8
opendilab/DI-engine #334

[Error] AttributeError: 'InteractionSerialEvaluator' object …

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [x] system worker bug + [ ] system utils bug + [ ] code design/refactor …

mahuangxu updated 2 years ago
2

上一页 1...1 2 3 4 5 6 7...7 下一页

63 results for dqn-variants

63 results
for dqn-variants