prioritized-experience-replay Search Results

245 results
for prioritized-experience-replay

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

KNakane/tensorflow #5

Prioritize Experience Replay作成

KNakane updated 5 years ago
2
kairproject/schedule #13

[120분] DDPG 논문 및 코드 리뷰

whikwon updated 5 years ago
10
chainer/chainerrl #278

Replicate Prioritized Experience Replay's reported performan…

Missing details - "all weights w_i were scaled so that max_i w_i = 1". Is max_i w_i computed over a minibatch or the whole buffer? - What is the value of epsilon that is added to absolute TD errors?

muupan updated 5 years ago
9
weidler/RLaSpa #7

Create DQN framework

weidler updated 5 years ago
2
kairproject/schedule #20

[30분] DDPG from Demonstration 관련논문 소개

1. [Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards](https://arxiv.org/pdf/1707.08817.pdf)

Curt-Park updated 5 years ago
13
hill-a/stable-baselines #77

[Question] DQN vs Open AI Baseline's Rainbow agent

The rainbow agent by default experienced the best base result in sonic for the OpenAI team by a large margin, if you exclude the ridiculously resource intensive parallel PPO training: https://arxiv…

wilkinsmicawber updated 5 years ago
1
patrickvonplaten/TRexGameRL #27

Build the Dueling DQN (aka DDQN)

patrickvonplaten updated 5 years ago
1
tensorforce/tensorforce #512

Problem with Memory

Hi, I'm a newbie to Deep RL and tensorforce and I'm trying to understand all the aspects of the algorithms. I'm using the PPO agent right now but I have some doubts regarding the Update Method a…

SestoAle updated 5 years ago
14
MillionIntegrals/vel #1

Implement policy gradient reinforcement learning algorithms

My next step is to have clean working and benchmarked policy gradient reinforcement learning algorithms.

MillionIntegrals updated 5 years ago
7
BeTomorrow/ReImproveJS #3

Is the library in a usable/robust state?

If I wanted to train a feedforward network agent with ~100 inputs, 8 outputs, and a hidden layer of 512 or so, can I use a DQN from this library to do it and expect it to work out okay? Does the DQ…

mturnshek updated 6 years ago
2

上一页 1...18 19 20 21 22 23 24...25 下一页

245 results for prioritized-experience-replay

245 results
for prioritized-experience-replay