a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kevingo/bookmarks #133

Reinforcement learning

http://twitter.com/kevingo/status/940942203550969856

kevingo updated 6 years ago
1
deeplearninc/relaax #41

Reimplement DA3C in new pixie design

StanislavVolodarskiy updated 7 years ago
1
chainer/chainerrl #150

Synchronous parallel training

Asynchronous parallel training like A3C is supported by ChainerRL, but synchronous parallel training, where multiple actors interact with their own environments in a synchronous manner, is not support…

muupan updated 6 years ago
1
MorvanZhou/Reinforcement-learning-with-tensorflow #181

A3C程序中奖励函数的权重问题

对于奖励函数的设定是不是有什么要求啊？在A3C算法中使用的是状态值而不是动作值，那么奖励函数中的是不是要跟状态变量直接相关？而且还有个很迷奇的问题，为什么在相同权重的条件下，两次运行的结果差别很大？目前我的累积奖励值虽然有收敛趋势，但是波动还是很大！ ![total_reward](https://user-images.githubusercontent.com/68805707/884750…

Kaysenc0703 updated 1 year ago
1
MatheusMRFM/A3C-LSTM-with-Tensorflow #2

It is not train

Hello ! I wrote my own A3C with LSTM, but it was not perfect. When I trained the model with batches it doesn't train, but when I used all episode experiances, it was perfect (only feeded LSTM state o…

pppn9595 updated 7 years ago
1
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #44

TypeError: can't pickle _thread.RLock objects

I run the Cart_Pole.py with A3C&A2C on windows and got the error. Traceback (most recent call last): File "D:/学习/Deep-Reinforcement-Learning-Algorithms-with-PyTorch-master/results/Cart_Pole.py",…

shuferhoo updated 3 years ago
2
SKKUCS/graduationProject #19

보고서 준비

1. 설치 및 종속성 문제 명시적으로 해결하기 1-1. 우리가 어떤 라이브러리, 어떤 프레임워크 사용해서 어떻게 진행했는지 이야기 1-2. 설계의 독창성을 입증해야함. 완전한 구현체를 가져온 것이 아니라, 이미 잘 짜여진 라이브러리의 함수만 빌려온 것이 되어야 함. 2. 적용 이론 및 수식 확실히 하기. 2-1. 현재 다음 서적을 참고하고 있음:…

JesungKoo updated 5 years ago
9
NVlabs/GA3C #31

Cannot learn problems with a single, terminal reward

Thank you for the easy to use and fast A3C implementation. I created a simple problem for rapid testing that rewards 0 on all steps except the terminal step, where it rewards either -1 or 1. GA3C cann…

wagnew3 updated 6 years ago
4
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #31

RuntimeError: Expected object of type torch.FloatTensor but …

I run the Cart_Pole.py and got the error.

Jeffrey28 updated 4 years ago
9
miyosuda/async_deep_reinforce #11

about steps related to the reward

Hi Kosuke, I've tried your model on breakout game. The performance was amazing, the average score went up to 520 after 80M steps. It's far more better than any other model I've tried. But the …

congling updated 6 years ago
6

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for a3c

1000+ results
for a3c