asynchronous-advantage-actor-critic Search Results

45 results
for asynchronous-advantage-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

number9473/nn-algorithm #246

Asynchronous Methods for Deep Reinforcement Learning

# Asynchronous Methods for Deep Reinforcement Learning # - Author: Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuo…

joyhuang9473 updated 6 years ago
2
hongzimao/decima-sim #13

a question about loss founction

Hi, Here are two part loss in actor agent : adv loss and entropy loss, can you tell me why you add the entropy loss? I know the entropy weight decreased from 1 to 0.0001, but I do not know why yo…

CookieYo updated 4 years ago
1
microsoft/DeepSpeedExamples #337

DeepSpeed-Chat: prefetch of layers during reward model forwa…

When running step 3 with ZERO stage 3 enabled for both the actor and critic models, I get the following error (line numbers may be offset due to debug statements I've added): ``` File "/path/DeepSp…

adammoody updated 1 year ago
27
tambetm/simple_dqn #49

slow training speed in latest code?

I just tried the latest code, and found the training speed slowed down significantly, it used to be more than >200 steps_per_second, but right now it's ~100 steps_per_second 2017-09-24 15:08:08,844…

mw66 updated 6 years ago
9
gaoyuankidult/einstein #2

A bug to solve

> Traceback (most recent call last): > File "clock_gated_rnn.py", line 63, in > model.compile(loss='binary_crossentropy', optimizer='adam', class_mode="binary") > File "/usr/local/lib/python2…

i3esn0w updated 8 years ago
9
29-75/running-car #7

Running car를 개발하기 위한 참고 가능한 자료를 모아보자

gon-park updated 4 years ago
12
google-research/batch-ppo #17

Distributed training with Kubernetes

Opening this issue to start a discussion about whether it would be worth investing to make it easy to run tensorflow agents K8s. For some inspiration you can look at [TfJob CRD](https://github.com/…

jlewi updated 6 years ago
28
Gin04gh/datascience #6

論文・資料メモ

1. [Binary Relevance Efficacy for Multilabel Classification](https://link.springer.com/article/10.1007/s13748-012-0030-x) > https://github.com/Gin04gh/datascience/issues/6#issuecomment-419388287 1. […

Gin04gh updated 4 years ago
37
keean/zenscript #51

new wd-40 test

NodixBlockchain updated 4 years ago
602
pmh-only/issue #11

끝말잇기 2

pmh 가 닫아서 다시 시작합니다.

noamboy2006 updated 4 years ago
13

上一页 1...1 2 3 4 5...5 下一页

45 results for asynchronous-advantage-actor-critic

45 results
for asynchronous-advantage-actor-critic