safe-reinforcement-learning Search Results

325 results
for safe-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

keras-rl/keras-rl #21

Progress of A3C branch

Is this experimental because it doesn't work, or because it hasn't been tested well enough yet? If the former, what's remaining to do and would you accept PRs?

deontologician updated 5 years ago
9
DandelionSprout/adfilt #7

General filter chit-chat

Edit by Dandelion Sprout 25th of February 2020: GitHub spectacularly failed to tell me that there was a comment limit of 2,500 comments in issue threads?! So now it seems that discussion has been forc…

THEtomaso updated 4 years ago
2470
w3c/dxwg #642

prof:inheritedFrom needs more convincing case and/or example

Examples are now provided for this, however this concept still appears to be unclear to some. This issue remains open pending a round of feedback based on improvements to examples and definition. O…

aisaac updated 5 years ago
42
aimacode/aima-javascript #27

Division of work

**If you are applying to Google Summer of Code, do not work on any issues. Read the wiki instead**. In 2016–2017 this issue tracked the list of chapters and their discussion pages. As of 2019 it is…

Ghost---Shadow updated 5 years ago
73
leela-zero/leela-zero #458

Bad games being uploaded?

Found a game uploaded that ends without passing or resignation. http://zero.sjeng.org/view/901a20869e379a162556b5965f498cef9f0c4575f2bc8391c563e9dd977fd03f?viewer=wgo It is a short game that ends aft…

Hersmunch updated 5 years ago
141
chuyangliu/snake #25

Questions about the local state usage and PER updates

@chuyangliu Could you help me, please, with some questions? 1. I read in your docs, about the algorithms, that you're using a local state vector: > The second part is the local state vector, whi…

voaneves updated 6 years ago
9
tensorflow/tensorflow #16106

Eager: Invalid placement of vars/consts depending on their t…

Hi, I'm currently testing eager execution on TF 1.5.0-rc1 (built it with XLA and CUDA enabled) and observe strange behavior: variables and constants get created either on GPU or CPU depending on thei…

kpot updated 5 years ago
17
microsoft/AirSim #631

Unreal window crashes

@sytelus @all Since the new AirSim updates, my test environment crashes after a while, when I perform my reinforcement learning. [Here my project](https://github.com/Kjell-K/AirGym) I get this er…

Kjell-K updated 6 years ago
14
modumarl/proposal #1

MARL 방법론 리서치

# 중요 논문 ## NIPS 2017 Accepted - [Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments](http://papers.nips.cc/paper/7217-multi-agent-actor-critic-for-mixed-cooperative-competit…

jahyun-dev updated 6 years ago
1
deepdrive/deepdrive #6

On imitation learning code release.

Thanks again for this wonderful project! 1. I am curious whether you are planning to release imitation learning code. Right now I can see we are using something like behavior cloning strategy (CNN …

cmuspencerlo updated 6 years ago
3

上一页 1...27 28 29 30 31 32 33...33 下一页

325 results for safe-reinforcement-learning

325 results
for safe-reinforcement-learning