-
Post your question here about the orienting readings: “Reinforcement Learning” and “Deep Reinforcement Learning”, Thinking with Deep Learning, Chapters 15 & 16.
lkcao updated
2 years ago
-
Hi there, I get running error when trying to run an agent, any tips on solving it?
Traceback (most recent call last):
File "play.py", line 20, in
Game.fit_model()
File "/Users/maciejwia…
-
(From [here](https://github.com/fpadula/visualcollisionarm/commit/63c7b2aff3ae589b81df2a82aece2af0d89c47b2#commitcomment-139765127))
Hi @Hokite, please open an issue ticket next time. The learning …
-
Hello, is it possible to provide the paper corresponding to the code, thank you very much!
-
Reinforcement learning techniques seem relevant to the bandit approach.
https://scikit-learn.org/stable/faq.html#why-is-there-no-support-for-deep-or-reinforcement-learning-will-there-be-support-for…
-
From our website:
> Flow: a deep reinforcement learning framework for mixed-autonomy traffic
>
> Flow leverages state-of-the-art deep RL libraries and the open-source microsimulator, SUMO, enabli…
-
https://github.com/dotnet/TorchSharp
https://github.com/dotnet/TorchSharp/discussions/334#discussioncomment-1258501
Apply the Deep-MAT-Deformation using Torchsharp on Godot (TorchSharp works on …
-
New environments to be created based on:
* [Direct Shape Optimization through Deep Reinforcement Learning](https://arxiv.org/pdf/1908.09885.pdf)
* [Aerodynamic Shape Optimization using a Novel Opt…
-
- https://arxiv.org/abs/2103.05187
- 2021
本論文では,提案不要の参照表現接地タスクに取り組み,既製のオブジェクト提案に頼らずに,クエリ文に応じてターゲットオブジェクトをローカライズすることを目指す.
既存の提案不要の手法は、クエリと画像のマッチングを行い、画像特徴マップの中で最もスコアの高い点をターゲットボックスの中心として選択し、その幅と高さを…
e4exp updated
2 years ago
-
## 一言でいうと
Model Baseの手法で学習を行う際に、環境全体をモデル化するのでなく、局所的なパートだけモデル化して(このとき戦略も線形化する)、戦略の勾配を推定するという手法。これにより環境全体をモデル化する必要なしにModel Baseによる効率的な学習が可能になる。
![image](https://user-images.githubusercontent.com/5…