deep-reinforcement-learning Search Results

huypham37/AIML14 #17

Explore and Document algorithms to use in Phase 2 and 3

huypham37 updated 1 month ago

reiniscimurs/DRL-robot-navigation #155

Effect of eval_freq

Dear Reinis Cimurs, I recently read your essay titled "Goal-Driven Autonomous Exploration Through Deep Reinforcement Learning"，I think your paper is fantastic and having watched your videos on youtub…

zyw0319 updated 3 weeks ago

Thinking-with-Deep-Learning-Spring-2022/Readings-Responses #8

Reinforcement Learning -Orientation

Post your question here about the orienting readings: “Reinforcement Learning” and “Deep Reinforcement Learning”, Thinking with Deep Learning, Chapters 15 & 16.

lkcao updated 2 years ago

citation-style-language/styles #7187

Is UWE Harvard style broken?

I am currently trying to generate two citations using the UWE Bristol Harvard format. At the moment when I do it I get: ``` Fontanella, C. (2022) A Beginner's Guide to Customer Behavior Analysis:d…

kamransoomro84 updated 2 months ago

e4exp/paper_manager_abstract #564

Iterative Shrinking for Referring Expression Grounding Using…

- https://arxiv.org/abs/2103.05187 - 2021 本論文では，提案不要の参照表現接地タスクに取り組み，既製のオブジェクト提案に頼らずに，クエリ文に応じてターゲットオブジェクトをローカライズすることを目指す．既存の提案不要の手法は、クエリと画像のマッチングを行い、画像特徴マップの中で最もスコアの高い点をターゲットボックスの中心として選択し、その幅と高さを…

e4exp updated 3 years ago

dancrew32/ab #8

Research reinforcement learning

Reinforcement learning techniques seem relevant to the bandit approach. https://scikit-learn.org/stable/faq.html#why-is-there-no-support-for-deep-or-reinforcement-learning-will-there-be-support-for…

dancrew32 updated 5 years ago

rll/rllab #170

A script to reproduce "Benchmarking Deep Reinforcement Learn…

Do you provide an official ready-to-run script to reproduce the exact results of the "Benchmarking Deep Reinforcement Learning for Continuous Control" paper?

wjaskowski updated 7 years ago

jfmartinz/ResourceHub #393

💡 [FEATURE] - Adding proper sub topics for Machine Learning.

### Idea Contribution - [X] I have read all the feature request issues. - [X] I'm interested in working on this issue - [X] I'm part of GSSOC organization ### Explain feature request Adding proper …

karishmaaa101 updated 6 months ago

arXivTimes/arXivTimes #910

SOLAR: Deep Structured Latent Representations for Model-Base…

## 一言でいうと Model Baseの手法で学習を行う際に、環境全体をモデル化するのでなく、局所的なパートだけモデル化して(このとき戦略も線形化する)、戦略の勾配を推定するという手法。これにより環境全体をモデル化する必要なしにModel Baseによる効率的な学習が可能になる。 ![image](https://user-images.githubusercontent.com/5…

icoxfog417 updated 5 years ago

arXivTimes/arXivTimes #1173

Playing Text-Adventure Games with Graph-Based Deep Reinforce…

## 一言でいうとテキストアドベンチャーゲームを、強化学習＋知識グラフで攻略したという研究。ゲームは選択肢で分岐して進むため、得られた状態で内部のグラフを更新していく。グラフ表現(Graph Convolution + Attention)+テキスト表現(一定範囲のBi-LSTM)で行動価値を出力する(行動数はグラフで絞り込む)。 ### 論文リンク https://arxiv…

icoxfog417 updated 5 years ago

1000+ results for deep-reinforcement-learning

1000+ results
for deep-reinforcement-learning