deterministic-policy-gradients Search Results

138 results
for deterministic-policy-gradients

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kunimasa-kawasaki/arXiv_Robotics #25

🚧 2017: Overcoming Exploration in Reinforcement Learning wit…

Overcoming Exploration in Reinforcement Learning with Demonstrations Ashvin Nair, Bob McGrew, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel 8 pages, ICRA 2018 https://arxiv.org/abs/1709.10089

kunimasa-kawasaki updated 2 years ago
1
202219807/700099_MSC_22_039 #6

Design neural network

202219807 updated 1 year ago
2
SforAiDl/genrl #196

Usage explanatory docs

Go to the `docs/source/usage/tutorials` and add separate `.md` files to explain the following: - [x] Using A2C (@Darshan-ko ) - [ ] Using PPO1 - [x] Using VPG (@Devanshu24 ) - [ ] Using DQN(s) - …

sampreet-arthi updated 3 years ago
44
zoq/arxiv-updates #320

New submissions for Thu, 11 Aug 22

## Keyword: sgd There is no result ## Keyword: optimization ### A Model-Constrained Tangent Manifold Learning Approach for Dynamical Systems - **Authors:** Authors: Hai Van Nguyen, Tan Bui-Thanh -…

zoq updated 1 year ago
1
dennybritz/reinforcement-learning #30

DQN solution results peak at ~35 reward

Hi Denny, Thanks for this wonderful resource. It's been hugely helpful. Can you say what your results are when training the DQN solution? I've been unable to reproduce the results of the DeepMind p…

nerdoid updated 7 months ago
85
keras-team/keras #18468

Keras.io examples conversion gameplan

We need to convert keras.io examples to work with Keras 3. This involves two stages: ## Stage 1: tf.keras backwards compatibility check Keras 3 is intended as a drop-in replacement for tf.ker…

fchollet updated 6 months ago
21
Thinking-with-Deep-Learning-Spring-2022/Readings-Responses #14

Week 8 - Possible Readings

Post a link for a "possibility" reading of your own on the topic of Reinforcement Learning [for week 8], accompanied by a 300-400 word reflection that: 1) briefly summarizes the article (e.g., as we d…

lkcao updated 2 years ago
21
arXivTimes/arXivTimes #439

Rainbow: Combining Improvements in Deep Reinforcement Learni…

## 一言でいうと今まで出てきたDQNの手法を組み合わせて'Atari 2600 benchmark'でState of artsを達成 ### 論文リンク https://arxiv.org/pdf/1710.02298.pdf ### 著者/所属機関 Matteo Hessel/DeepMind Joseph Modayil/DeepMind Hado va…

SnowMasaya updated 6 years ago
10
aesara-devs/aesara #871

Overflow of large integer constants in Windows

Whereas NumPy correctly uses the `int64` dtype, Aesara doesn't until it's told to: ```python >>> np.array(- (2**32)) array(-4294967296, dtype=int64) >>> at.constant(- (2**32)) TensorConstant{…

michaelosthege updated 2 years ago
6
DLR-RM/stable-baselines3 #1957

[Bug]: PPO using SDE device issue.

### 🐛 Bug Get a device mismatch when attempting to use PPO with multiinput dict. This was when calling: ``` with torch_no_grad(): actions = myppo.policy._predict(inp_dict, de…

llewynS updated 2 months ago
4

上一页 1...1 2 3 4 5 6 7...14 下一页

138 results for deterministic-policy-gradients

138 results
for deterministic-policy-gradients