guided-policy-search Search Results

416 results
for guided-policy-search

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mabonki0725/MLandRobotic #1

Guided Policy Search

This document is complex trajectory robot control model which combine Machine Learning and Deep Learning and Modern Control Theory. https://arxiv.org/abs/1504.00702

mabonki0725 updated 8 years ago
1
arXivTimes/arXivTimes #1165

Guided Meta-Policy Search

## 一言でいうと強化学習におけるメタラーニングについて、「メタ」側の学習時間を模倣学習で短縮する研究。様々なタスクに適合可能な初期値を得るには、複数タスクによる十分なメタ学習が必要であり、これはOn-policyの場合特に時間がかかる。そこで模倣学習+Off-policyにより高速な学習を行う手法を提案 ### 論文リンク https://arxiv.org/abs/1904…

icoxfog417 updated 5 years ago
2
MillionIntegrals/vel #21

Implementation of Guided Policy Search

Can you implement Guided Policy Search Algorithm as described here (https://papers.nips.cc/paper/5444-learning-neural-network-policies-with-guided-policy-search-under-unknown-dynamics.pdf). There isn'…

kabirahuja2431 updated 4 years ago
7
mabonki0725/MLandRobotic #3

Guided Policy Search with Transfer

This paper is Transfer model of Guided Policy Search https://arxiv.org/abs/1609.07088

mabonki0725 updated 8 years ago
1
mabonki0725/MLandRobotic #4

Guided Policy search with memory

This Paper is about assist Guided Policy Search by using memory which implemented by RNN https://arxiv.org/abs/1507.01273

mabonki0725 updated 8 years ago
1
siemanko/guided-policy-search #1

Bugs in temporal_multi_step_policy_model

File "~/guided-policy-search/guided/tmodel/temporal_multi_step_policy_model.py", line 75, in create_misc_functions hiddens = [T.vector() for i in range(self._mlp.params)] TypeError: 'list' object …

qazmichaelgw updated 8 years ago
3
hanyas/trajopt #3

Lack of python package: "mimo"...

Hi, developer. I am glad to you provide this repo for us to learn about Guided policy search. After following the readme to install this package successfully, I try to run example/gps/mf_lqr.py. But …

CodingCatMountain updated 5 months ago
1
kimhc6028/policy-gradient-importance-sampling #1

Hi, is there a referenced paper?

Hi, is there a referenced paper?

ruizhaogit updated 6 years ago
2
vllm-project/vllm #10429

[Bug]: rocm issue

### Your current environment AMD radon + kubernetes ### Model Input Dumps `vllm serve mistralai/Mistral-7B-Instruct-v0.3 --trust-remote-code --enable-chunked-prefill --max_num_batch…

YYXLN updated 3 hours ago
1
Toto-0/Robotics2-project #1

A problem with train.py in ocs2_ballbot_mpcnet of ocs2

Sorry to bother you, I just started learning OCS2 and encountered an issue while running train.py in ocs2_ballbot_mpcnet : it always stuck at "waiting for the first data" . I found your question in th…

Knight-xiao updated 7 months ago
2

上一页 1...1 2 3 4 5 6 7...42 下一页

416 results for guided-policy-search

416 results
for guided-policy-search