proximal-policy-optimization Search Results

166 results
for proximal-policy-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

521xueweihan/HelloGitHub #2558

【开源自荐】 PPO × Family DRL Tutorial Course | 决策智能入门级公开课

## PPO × Family DRL Tutorial Course | 决策智能入门级公开课 -项目地址：https://github.com/opendilab/PPOxFamily - 类别：机器学习 - 项目标题：PPO × Family DRL Tutorial Course ｜决策智能入门级公开课：8 节课帮你盘清算法理论，理顺代码逻辑，玩转决策 AI 应用实践 …

VaninaY updated 1 year ago
1
Ablesh1/ml_project_2023 #4

Do research

1. Selection of ML algorithms to be used in the project 2. Literature review of ML algorithms 3. Code review of ML algorithms https://learningtoplaydotnet.files.wordpress.com/2020/08/ptl4.pdf [found…

Ablesh1 updated 1 year ago
2
tinkoff-ai/CORL #25

The results about td3_bc on Antmaze

Hi May I ask the setting about td3_bc on antmaze. I find current hyperparameters can not work well and obtain a similar result as in the paper. Best

lucasliunju updated 1 year ago
8
anowell/are-we-learning-yet #118

Add crate: dfdx

Please add crate to category: Neural Networks If you're open to multiple categories, I think dfdx could also be added to Reinforcement Learning (I have some examples of a Deep Q Network and Proxima…

coreylowman updated 1 year ago
1
DLR-RM/stable-baselines3 #1502

[Bug]: Inconsistent results with trained agent in OpenAI gym…

### 🐛 Bug I'm using the Proximal Policy Optimization (PPO) algorithm to train an agent in an OpenAI gym environment for trading. After training the agent and saving it, I reload it and run simulati…

chrishsr updated 1 year ago
2
Lightning-AI/pytorch-lightning #10239

Call for High Quality Lightning Lectures - Community Sprint

## 🚀 Feature Dear community, As PyTorch Lightning mature, we believe it is important for the Lightning Team and its community to improve the Lightning onboarding process. In that regards…

tchaton updated 1 year ago
16
siemens/powergym #4

Proximal Policy Optimization (PPO) and Soft Actor-Critic (SA…

I have just gone through the paper "PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems". I have found it insightful and thanks for sharing this reposito…

sahasubhajit updated 1 year ago
3
huggingface/trl #151

Distributed training stuck

I'm using the gpt2-sentiment.py script in examples for distributed training, where the data and reward models are replaced with our own. If there is no problem with using a GPU for training, it can…

koking0 updated 1 year ago
6
africamonkey/autopilot-cross-intersection #1

Reference papers

Hello Thank you for sharing your materials! And I am very happy with your modified Flow. In the past, I tried to install Flow from the official repo but it always had errors. With your repo, it is e…

TrinhTuanHung2021 updated 1 year ago
2
junhuihuang/webpages #3

Stanford MLSys Seminar Series |

I will progressively summarize talks I find illuminating from the [Stanford MLSys](https://mlsys.stanford.edu/) Seminar Series here. Talk Link: [https://www.youtube.com/watch?v=DB7oOZ5hyrE](https://w…

junhuihuang updated 1 year ago
2

上一页 1...8 9 10 11 12 13 14...17 下一页

166 results for proximal-policy-optimization

166 results
for proximal-policy-optimization