deep-rl Search Results - Githubissues

1000+ results
for deep-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Kismuz/btgym #124

Overestimated Value Function in Actor Critic Framework

@Kismuz, I believe I have encountered a framework (A3C) limitation. While training a few of my recent models I noticed a strange behavior. For the first part of training everything seems to work fi…

JaCoderX updated 4 years ago
7
nebuly-ai/optimate #298

[Chatllama] KL Divergence equation

Hello, I have a quick question. I know most RLHF structure use KL divergence. https://github.com/nebuly-ai/nebullvm/blob/aad1c09ce20946294df3ec83569bad9496f58d0e/apps/accelerate/chatllama/chatllam…

mountinyy updated 1 year ago
3
Luca96/carla-driving-rl-agent #30

Error related to keras

Hi, I am getting this error. I tried to change as suggested in this but still I am not able to run the file. pygame 2.0.1 (SDL 2.0.14, Python 3.8.10) Hello from the pygame community. https://www.p…

SExpert12 updated 1 month ago
8
will-jac/rl-spades #1

Test code

Is there any test code that can run through?

Biao-K updated 2 years ago
1
hill-a/stable-baselines #367

[question] Why are RL CNNs so shallow?

It seems that RL CNNs are much more shallow than the ones used on imagenet? Am I right about this? And why would that be the case?

AlanKuurstra updated 4 years ago
2
kevingo/bookmarks #133

Reinforcement learning

http://twitter.com/kevingo/status/940942203550969856

kevingo updated 6 years ago
1
sherjilozair/dqn #4

dqn.py indexing is not right

I found the indexing in build_function not right. You can run the code below to testify the wrong indexing in VS[:, A] This indexing should be written like line 51, 52 in https://github.com/ShibiHe/D…

ShibiHe updated 7 years ago
1
TMats/survey #183

Continuous control with deep reinforcement learning

https://arxiv.org/abs/1509.02971 - Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra - Submitted on 9 Sep 2015 (v1), last r…

TMats updated 6 years ago
1
neurodata/ProgLearn #289

Add Progressive Reinforcement Capabilities (potentially via …

**Is your feature request related to a problem? Please describe.** We would like to devise a Reinforcement approach that leverages progressive learning to improve its in-task predictions in mapping s…

levinwil updated 2 years ago
3
godot-rust/book #8

Candidates for FAQ

It's not yet clear whether we'll have an FAQ section in the book and if yes, how we are going to structure it. One idea would be to have 100% of the information as part of the tutorial, and use the FA…

Bromeon updated 5 days ago
9

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for deep-rl

1000+ results
for deep-rl