a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MegaMek/mekhq #2377

RFE: Assign unit to force from Hanger screen

### Environment MekHQ 0.47.16 Java version 15.0.1 Platform Mac OS X 10.15.7 (x86_64) ### Description When you start getting into dozens of units and a handful of forces, it can be desirable…

mjbroekman updated 3 years ago
2
Itsukara/async_deep_reinforce #3

Experiment Results

This thread is used for sharing experiment results. I'd appreciate if you could write your experiment result to this thread when you try my code. The following messages are sample reports.

Itsukara updated 8 years ago
6
Kismuz/btgym #124

Overestimated Value Function in Actor Critic Framework

@Kismuz, I believe I have encountered a framework (A3C) limitation. While training a few of my recent models I noticed a strange behavior. For the first part of training everything seems to work fi…

JaCoderX updated 4 years ago
7
hongzimao/pensieve #68

conv2d in multi_video_sim

jlabhishek updated 5 years ago
2
xinleipan/VirtualtoReal-RL #3

How to use this TORCS in reinforcement learning ?

I notice that image is available through shared memory with C/C++. But for reinforcement learning, I also need to send control instruction to it. What's more, it's better to get image via python, sin…

XiaoZzai updated 5 years ago
11
e4exp/paper_manager_abstract #357

Deep Reinforcement Learning for Programming Language Correct…

- https://arxiv.org/abs/1801.10467 - 2018 初心者のプログラマーは、プログラミング言語の形式的な構文に悩まされることが多い。そこで我々は、強化学習が可能な新しいプログラミング言語修正フレームワークを設計した。このフレームワークでは、エージェントがテキストのナビゲーションと編集のために人間の動作を模倣することができる。本研究では、プログラミ…

e4exp updated 3 years ago
3
apache/mxnet #18280

[Performance Regression] GPU memory increase for training an…

## Description - There is an MXNet nightly benchmark which runs CV and NLP models on MXNet Nightly pip wheel and report the metrics and it showed a performance regression on GPU Memory. - After bise…

karan6181 updated 4 years ago
11
yr4000/Slither-ML-bot #1

Parameters

Hi, Project is missing "parameters" folder, could you add it, please? :)

Dimensionic updated 7 years ago
7
mobeets/q-rnn #20

Beron with KL penalty

todo: add KL penalty between current and marginal policy as an intrinsic reward/penalty log π(a|s)/p(a) the question is if this will induce perseveration the only thing to figure out is how to …

mobeets updated 7 months ago
7
datawhalechina/easy-rl #40

/chapter7/chapter7

https://datawhalechina.github.io/easy-rl/#/chapter7/chapter7 Description

qiwang067 updated 9 months ago
11

上一页 1...28 29 30 31 32 33 34...100 下一页

1000+ results for a3c

1000+ results
for a3c