deep-rl Search Results - Githubissues

1000+ results
for deep-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NUStreaming/BoB #3

Question regarding heuristic and learning BWs in packet read…

Hello, I hope you're doing well, I have been attempting to replicate BoB's publication results, and although at surface level I appear to be getting some of that, after digging deeper and doing my …

HoustonHuff updated 1 year ago
2
originholic/a3c_vrep #3

Have you ever tried "Pendulum-V0"?

Thanks for the nice code. I am trying to re-produce the result in "Pendulum-V0" using a3c_cont.py but it seems the model fail to converge. I have tried various method like experience reply but still n…

yanpanlau updated 8 years ago
1
enpasos/DJL2OnnxExample #1

ONNX export from Java

I came across this repo from the DJL repo, and you might be interested in the ONNX export functionality we built in Tribuo's last release (v4.2.0). We have a separate module (which only depends on pro…

Craigacp updated 2 years ago
3
keiohta/tf2rl #116

Implement CURL

CURL: Contrastive Unsupervised Representations for Reinforcement Learning, [paper](https://proceedings.icml.cc/static/paper_files/icml/2020/5951-Paper.pdf).

keiohta updated 3 years ago
2
hrpan/tetris_mcts #3

MCTS simulation in parallel

Hey there, I also use mcts to predict good actions. However in my case (multi player card game) it is very expensive to look ahead very far. For this reason I want to ask you if you know if there is …

CesMak updated 4 years ago
1
StrangeLoopGames/EcoSuggestions #1675

Option to remove hand while running

Please put an option to disable this.

houseofdoggus updated 2 years ago
1
Thinking-with-Deep-Learning-Spring-2024/Readings-Responses #18

Week 9. May. 17: Reinforcement Learning - Possibilities

Pose a question about one of the following articles: “[Human-level control through deep reinforcement learning](https://www.nature.com/articles/nature14236)” 2015. V. Mnih...D. Hassabis. Nature 51…

JunsolKim updated 5 months ago
22
calcom/cal.com #11900

Conditional Questions on Event Types

### Is your proposal related to a problem? We would like to have the ability to ask "follow-up questions" based on the answers to previous questions in the booking request. For example, we could h…

moilejter updated 1 month ago
16
kunimasa-kawasaki/arXiv_Robotics #25

🚧 2017: Overcoming Exploration in Reinforcement Learning wit…

Overcoming Exploration in Reinforcement Learning with Demonstrations Ashvin Nair, Bob McGrew, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel 8 pages, ICRA 2018 https://arxiv.org/abs/1709.10089

kunimasa-kawasaki updated 2 years ago
1
PCCproject/PCC-Uspace #9

Problem about reward and loss in online training

Hi, Thank you for your dedicated work of PCC-Uspace. When I followed the instruction in Deep_Learning_Readme.md, I found that values of both Reward and Ewma Reward were so high as the snapshot…

Enjia updated 4 years ago
3

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for deep-rl

1000+ results
for deep-rl