deep-rl Search Results - Githubissues

1000+ results
for deep-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

liuyuemaicha/Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow #12

Are you still updating the code?

Thank you for helping me a lot, due to your code. Sorry, but I met some code problems that I can't solve. Looking forward to your updating!

yjyGo updated 5 years ago
2
wouterkool/attention-learn-to-route #53

Reimplementation in RL platform (CleanRL)

Hello there, my team has been trying to implement the Attention Model in RL platforms so that we can try out different RL algorithms. Eventually, we succeed to implement the most efficient one with PP…

cpwan updated 8 months ago
2
justheuristic/mariewelt-mdp-med #2

попытаться запилить

На первом этапе будет круто более близко познакомиться с deep RL. - [x] выбрать environment с не очень частыми reward-ами, который хоть как-то решается MDP - например,box2d/LunarLander | atari/berze…

justheuristic updated 8 years ago
1
FragileTech/FractalAI #29

[Suggestion- Extending Results] FAI as alternative uses-case…

- One of the big results in ["Learning to Plan Chemical Syntheses"](https://arxiv.org/abs/1708.04202) or ["Towards "AlphaChem": Chemical Synthesis Planning with Tree Search and Deep Neural Network Pol…

0bserver07 updated 6 years ago
3
MOCR/DDPG #3

DDPG Actor output saturate

Hello~ I have some question about DDPG Ｗhen my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and sigmoid), the output of actor will satur…

m5823779 updated 5 years ago
1
ALRhub/deep_rl_for_swarms #1

Readme

Please update the readme, in order to make the code more understandable. Also, can I implement these for a single pursuer and a single evader. If yes, please brief on the steps. Thanks

vsd550 updated 5 years ago
2
pentium3/sys_reading #216

FIRM: An Intelligent Fine-grained Resource Management Framew…

https://www.usenix.org/conference/osdi20/presentation/qiu

pentium3 updated 2 weeks ago
1
keras-team/keras #19776

Loading model fails: can only concatenate tuple

Hi, I'm trying to save and load the model from this example: https://keras.io/examples/rl/deep_q_network_breakout/ Saving the model works. When I load the model I'm getting the following error: `…

sebplorenz updated 1 month ago
1
virtualmlnet/hackathon-2021 #6

.NET Reinforcement learning (e.g. OpenAI Gym) using Godot a…

## Hackathon Idea A Godot template project for [OpenAI Gym](https://github.com/openai/gym) using only .NET machine learning framework(s) ### Your name - Jim (aka GeorgeFFM at Discord) - She…

GeorgeS2019 updated 2 years ago
13
number9473/nn-algorithm #250

Playing Atari with Deep Reinforcement Learning

# Playing Atari with Deep Reinforcement Learning # - Author: Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller - Origin: https://ar…

joyhuang9473 updated 6 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for deep-rl

1000+ results
for deep-rl