-
# Human-level control through deep reinforcement learning #
- Author: Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller…
-
I want to make a project using reinforcement learning in which a bot send scam to other bots on social media, other bots detect the scam and reject it.
I think it needs a deep reinforcement learning…
-
Very interesting project. I've been recently trying to test it with 3D images, but I didn't have much progress. I believe it has to do with the shape of the output of the Policy network being pout.sa…
-
Hi, I am new to Tensorflow and interested in running this project.
But I don't see test descriptions in your readme wiki. Could you please give a description how to test the model?
Thanks a lot.
-
Em dang dung ubuntu 22.04, va tai ibusbamboo nhung khong dung duoc tren chrome.
-
Dear Mr.hongzi
I was interested in your resource scheduling method. Now, I stuck in your network class. I can't understand why you used the blow function:
`loss = T.log(prob_act[T.arange(N), actions…
-
I'm just starting to learn about reinforcement-learning and I found that this is a great resource, but I notice the answer on dp policy evaluation could possibly be misleading.
the answer update ea…
-
### Current Behavior
If the reference sequence provided to the `augur translate` command has invalid characters in a gene name (e.g. spaces), this will eventually lead to an error during `augur expor…
-
Hi guys,
I'm using your PySocialForce package to model a robot / pedestrian interaction in a 2D world. It's really a great effort from your side to create this package and put it on PyPI. Unfortuna…
-
Hello,
I have added more number of RL vehicles in the example "cooperative merge" distributively. But I have noticed that, my agents which I have added, does not accelerate much when compared with 'h…
pnp91 updated
5 years ago