safe-reinforcement-learning Search Results

324 results
for safe-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PKU-Alignment/omnisafe #358

[Question] A question about the cost function of the p3o alg…

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/omnisafe/issues) and [Discussions](https://github.com/PKU-A…

Liqinyan821 updated 3 days ago
1
kgex/developer-roadmap #514

Add Safe Reinforcement Learning resource

DineshkumarS05 updated 1 year ago
1
arXivTimes/arXivTimes #24

Safe and efficient off-policy reinforcement learning

## 一言でいうと returnベースの方策オフ強化学習における安全で効率的なアルゴリズムの提案。安全とは、方策の"オフ具合"に対して性能がロバストであること。効率的とは、学習効率が良いこと。収束性の保証と実験を与えた。NIPS 2016に通っていて、真面目に解析を読むのはつらそう。 ### 論文リンク https://arxiv.org/abs/1606.02647 ### 著者…

alb-ktaka updated 6 years ago
1
modumarl/proposal #6

논문 리뷰 목록

논문 리뷰 후보 - [ ] [Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments](http://papers.nips.cc/paper/7217-multi-agent-actor-critic-for-mixed-cooperative-competitive-environments) -…

jahyun-dev updated 6 years ago
3
kurtzace/diary-2024 #14

AWS deep racer

[video](https://www.youtube.com/watch?v=HmD2x_WYfh8&t=581s) on the basics Car provided and its features ![image](https://github.com/user-attachments/assets/b8068b79-b553-4474-b12f-f95b3a2ca71f) O…

kurtzace updated 1 week ago
11
Kaixhin/Rainbow #23

Future improvements

First, hands down, amazing work. Serving as a baseline, I see a possible improvement, if someone wants to implement it: - The n-step return, as it is, is biased (as you are using old off-policy sam…

jaromiru updated 3 years ago
4
CoffeeKumazaki/arXiv #2928

A Safe Hierarchical Planning Framework for Complex Driving S…

A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning. (arXiv:2101.06778v1 [cs.RO]) https://ift.tt/3sEGmMH Autonomous vehicles need to handle various tra…

CoffeeKumazaki updated 3 years ago
2
yingchengyang/CPPO #2

can you provide the correlative paper about the cppo realise…

if the Mathematical proof in the paper Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk can support the code of cppo in this project? I can not understand the variable cv…

BigCakeLove updated 5 months ago
1
modumarl/proposal #5

20180417 회의록

MultiAgent RL ## 문제 설정 - 협동 => chase - 쫒는 애들은 MARL - 도망치는 애들은 룰기반 - combat? 싸움 알고리즘? - 평가? 룰기반 vs MARL 에이전트 잡는게 더 쉽다 축구는 적합한 상황이 아님. 패스 정도... ## 학습 방법론 - centralized - de…

jahyun-dev updated 6 years ago
2
liuzuxin/OSRL #25

The training output is empty

Hi,bro, Thank you so much for your contributions to offline safe reinforcement learning. Firstly, I close the wandb logger. And when I run the code such as train_cdt by `python .\examples\train\…

HenryZhang-git updated 6 days ago
9

上一页 1...1 2 3 4 5 6 7...33 下一页

324 results for safe-reinforcement-learning

324 results
for safe-reinforcement-learning