a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mrahtz/tensorflow-rl-pong #1

discount_rewards logic not clear

Hi, I am really stuck at the discount_rewards function. Can you explain the logic behind discount_rewards function. It seems its updating the rewards in forward direction

parikshitag updated 5 years ago
2
SKKUCS/graduationProject #5

의견, 소식 공유

Google Next Extended Seoul 2018 | Festa! https://festa.io/events/104 [Google Next Extended Seoul 행사는 빅데이터, 머신러닝, 데이터베이스, 서버리스 등 다양한 Google Cloud Platform 활용 사례를 공유하고 배우는 자리입니다. ] ...그렇다고 합니다. 이번 주말…

WinteringAsch updated 5 years ago
15
arXivTimes/arXivTimes #642

IMPALA: Scalable Distributed Deep-RL with Importance Weighte…

## 一言でいうと強化学習で大規模な分散学習を行う研究。A3Cでは各エージェントは勾配を中央サーバーに送るが、提案手法(IMPALA)では経験(状態/行動/報酬)をそのまま中央(Learner)に送りそこで学習する。よって末端エージェントはoff-policy学習となるが、各経験に重要度をふるためのV-traceという手法を提案している ![image](https://user-i…

icoxfog417 updated 6 years ago
1
minosworld/minos #48

benchmark code of four navigation algorithms

Hi, Will the benchmark code of four navigation algorithms in the paper be released? Also, how long does it take to train the agents? My english is poor, so I have some confusion about the followin…

nina124 updated 6 years ago
6
tiny-dnn/tiny-dnn #861

Add support for stateful loss functions

I'm playing around with reinforcement learning, and in this case, loss value should only be calculated for one output, and be zero for the rest. It's not much of a problem in most cases, as desired_ou…

MaxSavenkov updated 7 years ago
4
chris-chris/pysc2-examples #2

It seems dqn can't learn much

I ran the script last night. It started with ~11 mean reward, and ended with ~15.5 mean reward. I tried to play this mini-game myself, and I could get ~100 score or more. Deepmind reached ~100 score…

Seraphli updated 6 years ago
15
pytorch/pytorch #30388

Increasing memory usage on CPU

## 🐛 Bug I am trying to run a small neural network on the CPU and am finding that the memory used by my script increases without limit. Since my script does not do much besides call the network, th…

varun-intel updated 3 years ago
8
GeminiLight/hrl-acra #5

Test results, the acceptance rate is always 1

![屏幕截图 2024-06-03 112232](https://github.com/GeminiLight/hrl-acra/assets/147413930/ad2dd4b2-b7cb-4dbd-9da5-45782bede298) After I trained the model and ran several rounds of tests, I found that th…

lpj12121 updated 3 months ago
4
benelot/pybullet-gym #10

HumanoidBulletEnv-v0 crashing in multiple RL frameworks

Hello. I have been trying to train an agent in `HumanoidBulletEnv-v0`. I have tried using multiple frameworks and algorithms, but have not been able to obtain a good policy in this particular environ…

ycps updated 5 years ago
5
pytorch/pytorch #54874

Get a thread safe copy of torch::nn::Sequential object

I need to get a copy of `shared` neural network of type `torch::nn::Sequential`. It seems that there is no available API for this purpose at the moment. It seems that declaring and instantiating the n…

oroojlooy updated 3 years ago
6

上一页 1...23 24 25 26 27 28 29...100 下一页

1000+ results for a3c

1000+ results
for a3c