-
Thank you for helping me a lot, due to your code. Sorry, but I met some code problems that I can't solve. Looking forward to your updating!
yjyGo updated
5 years ago
-
Hello there, my team has been trying to implement the Attention Model in RL platforms so that we can try out different RL algorithms. Eventually, we succeed to implement the most efficient one with PP…
cpwan updated
8 months ago
-
На первом этапе будет круто более близко познакомиться с deep RL.
- [x] выбрать environment с не очень частыми reward-ами, который хоть как-то решается MDP
- например,box2d/LunarLander | atari/berze…
-
- One of the big results in ["Learning to Plan Chemical Syntheses"](https://arxiv.org/abs/1708.04202) or ["Towards "AlphaChem": Chemical Synthesis Planning with Tree Search and Deep Neural Network Pol…
-
Hello~ I have some question about DDPG
When my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and sigmoid), the output of actor will satur…
-
Please update the readme, in order to make the code more understandable.
Also, can I implement these for a single pursuer and a single evader. If yes, please brief on the steps.
Thanks
-
https://www.usenix.org/conference/osdi20/presentation/qiu
-
Hi,
I'm trying to save and load the model from this example: https://keras.io/examples/rl/deep_q_network_breakout/
Saving the model works. When I load the model I'm getting the following error:
`…
-
## Hackathon Idea
A Godot template project for [OpenAI Gym](https://github.com/openai/gym) using only .NET machine learning framework(s)
### Your name
- Jim (aka GeorgeFFM at Discord)
- She…
-
# Playing Atari with Deep Reinforcement Learning #
- Author: Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller
- Origin: https://ar…