proximal-policy-optimization Search Results

161 results
for proximal-policy-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PKU-Alignment/omnisafe #235

[Feature Request] APPO

### Required prerequisites - [X] I have searched the [Issue Tracker](https://github.com/OmniSafeAI/omnisafe/issues) and [Discussions](https://github.com/OmniSafeAI/omnisafe/discussions) that this has…

calico-1226 updated 3 months ago
2
deepmuseum/Algorithms-for-Reinforcement-Learning #1

PPO implementation

- https://openai.com/blog/openai-baselines-ppo/ - https://medium.com/intro-to-artificial-intelligence/proximal-policy-optimization-ppo-a-policy-based-reinforcement-learning-algorithm-3cf126a7562d - …

SofianChay updated 3 years ago
1
PKU-Alignment/omnisafe #313

APPO

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/omnisafe/issues) and [Discussions](https://github.com/PKU-A…

Eureka725 updated 3 months ago
1
d2l-ai/d2l-en #2424

Policy Optimization and PPO

Dear all, While the book currently has a small section on Reinforcement Learning covering MDPs, value iteration, and the Q-Learning algorithm, the book still does not cover an important family of a…

BrianPulfer updated 1 year ago
3
ftn-ai-lab/ori-2023-ra #2

Parkiranje automobila u 2D prostoru

### Student - Nikola Simić RA 32/2020 ### Asistent - Filip Volarić ### Problem koji se rešava - Cilj agenta je da se pozicionira na parking mesto za najkraći vremenski period. Na putu d…

dXellor updated 1 year ago
2
number9473/nn-algorithm #247

Actor-Critic Algorithms

# Actor-Critic Algorithms # - Author: Vijay R. Konda, John N. Tsitsiklis - Origin: https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf - Related: - PyTorch4 tutorial of: actor critic…

joyhuang9473 updated 6 years ago
2
rlturkiye/flying-cavalry #52

Explain PPO with RLlib

ugurkanates updated 3 years ago
1
alifanov/algotrading #1

Training EC Slowly ?

Hi alifanov, Thanks for giving out your code, it's a very good example. I try to train simple-EC , But I feel very slowly . Doesn't it take into account more CPU to train EC by synchronous ? T…

cn3c3p updated 6 years ago
6
RLE-Foundation/rllte #30

[Progress Report] Construction of RLLTE Data Hub

Due to the high computing power required for training, we will gradually upload data to the data hub and report the progress in this issue. We will also change the priority of training according to ne…

yuanmingqi updated 9 months ago
3
junxiaosong/AlphaZero_Gomoku #49

关于KL散度控制学习率的问题

您好，注意到代码中有通过比较新旧两个神经网络输出的KL散度来控制学习率的方法，实验过程中学习率先快速增加然后逐渐减少，说明这个方法确实有用。想问一下这种方法有相关的文献资料的介绍吗？还是您凭经验创造出来的呢？

rommeldhy updated 4 years ago
4

上一页 1...1 2 3 4 5 6 7...17 下一页

161 results for proximal-policy-optimization

161 results
for proximal-policy-optimization