a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rock-learning/bolero #9

Add more policy search algorithms and policy representations

Policy Search - [ ] [PI2](http://proceedings.mlr.press/v9/theodorou10a/theodorou10a.pdf), is already implemented #28 - [ ] [PoWER](http://www.ias.informatik.tu-darmstadt.de/publications/peters_ADPR…

AlexanderFabisch updated 5 years ago
1
MG2033/A2C #7

Dose A2C support experience replay?

I read your code and implement a version with experience replay. However, I find that the loss explode after a few frames(almost 1000). Value loss would be very large and action loss would be very ne…

ShaoyuanLi updated 5 years ago
2
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #45

FileNotFoundError: [Errno 2] No such file or directory

I run the Cart_Pole.py with A3C&A2C on linux and got the error. Traceback (most recent call last): File "/usr/local/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap self.r…

shuferhoo updated 3 years ago
2
YuhangSong/DHP #50

On 'v' action rewards

Hello dear mr. Yuhang Song, In the paper, it is mentioned that the rewards for action **v** are given by ![rewv](https://user-images.githubusercontent.com/28454109/75548914-044bd080-5a37-11ea-888…

aletd updated 4 years ago
1
MorvanZhou/pytorch-A3C #22

Process hangs at res_queue.get() in Linux

In discrete_A3C.py, the res_queue.get() in the main function hangs for a very long time (possibly forever) in Linux, but the entire code works perfectly fine on Windows. ``` workers = [Worker(gnet…

rons613 updated 2 years ago
2
arXivTimes/arXivTimes #308

Curiosity-driven Exploration by Self-supervised Prediction

## 一言でいうと強化学習において、特に高次元になると報酬が得られる機会はとても少なくなる。そこで「好奇心」、つまり新規性のある環境への到達について報酬を設定することで学習速度を上げる試み。これによりベースライン(A3C)よりも高い学習性能を記録することができた。Doomとマリオブラザーズのデモ有 ![image](https://cloud.githubusercontent.co…

icoxfog417 updated 7 years ago
1
muupan/async-rl #20

Installation: ImportError: No module named 'ale_python_inter…

When I try to run the saved model as : ``` bash python demo_a3c_ale.py ../roms/breakout.bin trained_model/breakout_ff/80000000_finish.h5 ``` I get an error : ``` bash ImportError: No module named '…

sahiliitm updated 7 years ago
1
29-75/running-car #7

Running car를 개발하기 위한 참고 가능한 자료를 모아보자

gon-park updated 3 years ago
12
miyosuda/async_deep_reinforce #27

use Multiprocessing instead of Threading?

The current implementation use Python Threading. I just wonder if this can be switched using Multiprocessing? Multiprocessing performs much better on multi-core CPUs than Threading in Python. Is…

mw66 updated 7 years ago
3
dennybritz/reinforcement-learning #46

Implementation of Value Iteration Networks paper - NIPS 2016…

Do you have plans to implement Value Iteration Networks paper - NIPS 2016 best paper in tensorflow . It would be great and fantastic

developeralgo8888 updated 7 years ago
3

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for a3c

1000+ results
for a3c