deep-rl Search Results - Githubissues

1000+ results
for deep-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HumanCompatibleAI/imitation #696

Support human preferences in “Deep RL from human preferences…

Our team [KABasalt](https://github.com/BASALT-2022-Karlsruhe) participated in last year's BASALT competition and we noticed that RLHP currently lacks support for human preferences. ## Problem: On…

mschweizer updated 1 year ago
4
broadinstitute/AutoTrain #6

Advanced Prototype - AutoTrain

Interesting Resources: - [RL Curriculum Learning](https://lilianweng.github.io/lil-log/2020/01/29/curriculum-for-reinforcement-learning.html) - [meta-RL](https://lilianweng.github.io/lil-log/2019/…

ctrlnomad updated 3 years ago
2
Farama-Foundation/Gymnasium #28

[Proposal] Tutorials

### Proposal To encourage the use of Gymnasium and build up the RL community, I would propose that a large range of tutorials are created. This is a list of tutorials that could be made - [x…

pseudo-rnd-thoughts updated 4 months ago
13
flow-project/flow #96

[documentation] unify descriptions of Flow

From our website: > Flow: a deep reinforcement learning framework for mixed-autonomy traffic > > Flow leverages state-of-the-art deep RL libraries and the open-source microsimulator, SUMO, enabli…

cathywu updated 5 years ago
3
arXivTimes/arXivTimes #642

IMPALA: Scalable Distributed Deep-RL with Importance Weighte…

## 一言でいうと強化学習で大規模な分散学習を行う研究。A3Cでは各エージェントは勾配を中央サーバーに送るが、提案手法(IMPALA)では経験(状態/行動/報酬)をそのまま中央(Learner)に送りそこで学習する。よって末端エージェントはoff-policy学習となるが、各経験に重要度をふるためのV-traceという手法を提案している ![image](https://user-i…

icoxfog417 updated 6 years ago
1
ll7/robot_sf_ll7 #11

Add a trianing scenario which uses wandb as experiment track…

ll7 updated 3 months ago
8
Kredaro/Deep-Reinforcement-Learning #2

Need research on efforts of using Deep RL in different areas…

Need research and documentation of major efforts in using Deep Reinforcement Learning in areas like education, health care and energy. Assigning @rithesh17 .

hackintoshrao updated 5 years ago
1
number9473/nn-algorithm #261

IMPALA: Scalable Distributed Deep-RL with Importance Weighte…

# IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures # - Author: Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Dor…

joyhuang9473 updated 6 years ago
1
IBM/pytorch-seq2seq #169

RuntimeError occurs running integration_test.py

**RuntimeError** occurs when I run python script *integration_test.py*. I did not modify any code, just installed *pytorch-seq2seq* and ran the script. Trying to find out how to run the script w…

jhyun0919 updated 5 years ago
1
rlberry-py/rlberry #325

User guide

I propose we do a user guide for rlberry. The outline of which would be something like this: * Installation * Basic Usage * Quick Start RL * Quick Start Deep RL * Set up of an experiment …

TimotheeMathieu updated 9 months ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for deep-rl

1000+ results
for deep-rl