proximal-policy-optimization Search Results

173 results
for proximal-policy-optimization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tinkoff-ai/CORL #25

The results about td3_bc on Antmaze

Hi May I ask the setting about td3_bc on antmaze. I find current hyperparameters can not work well and obtain a similar result as in the paper. Best

lucasliunju updated 1 year ago
8
anowell/are-we-learning-yet #118

Add crate: dfdx

Please add crate to category: Neural Networks If you're open to multiple categories, I think dfdx could also be added to Reinforcement Learning (I have some examples of a Deep Q Network and Proxima…

coreylowman updated 1 year ago
1
Lightning-AI/pytorch-lightning #10239

Call for High Quality Lightning Lectures - Community Sprint

## 🚀 Feature Dear community, As PyTorch Lightning mature, we believe it is important for the Lightning Team and its community to improve the Lightning onboarding process. In that regards…

tchaton updated 1 year ago
16
siemens/powergym #4

Proximal Policy Optimization (PPO) and Soft Actor-Critic (SA…

I have just gone through the paper "PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems". I have found it insightful and thanks for sharing this reposito…

sahasubhajit updated 2 years ago
3
huggingface/trl #151

Distributed training stuck

I'm using the gpt2-sentiment.py script in examples for distributed training, where the data and reward models are replaced with our own. If there is no problem with using a GPU for training, it can…

koking0 updated 1 year ago
6
junhuihuang/webpages #3

Stanford MLSys Seminar Series |

I will progressively summarize talks I find illuminating from the [Stanford MLSys](https://mlsys.stanford.edu/) Seminar Series here. Talk Link: [https://www.youtube.com/watch?v=DB7oOZ5hyrE](https://w…

junhuihuang updated 1 year ago
2
huggingface/trl #172

Stuck in Distributed Training with gpt2-sentiment.py

Update: Seems that I got stuck at `stats_to_np`. Hi, I encountered the same problem, where I got stuck at `gather_stats`. But I am using the official script as shown below. Could you help me take …

shizhediao updated 1 year ago
3
africamonkey/autopilot-cross-intersection #1

Reference papers

Hello Thank you for sharing your materials! And I am very happy with your modified Flow. In the past, I tried to install Flow from the official repo but it always had errors. With your repo, it is e…

TrinhTuanHung2021 updated 2 years ago
2
ixxmu/mp_duty #3153

7个流行的强化学习算法及代码实现！

https://mp.weixin.qq.com/s/DAPirChUTKZ9yLExJw86Tg

ixxmu updated 1 year ago
1
wandb/wandb #4246

[Q] events and code files not uploaded

Hi, I'm new to wandb and is running code in [ppo-implementation-details](https://github.com/vwxyzjn/ppo-implementation-details) following the video tutorial [Part 1 of 3 — Proximal Policy Optimization…

CarlossShi updated 2 years ago
6

上一页 1...9 10 11 12 13 14 15...18 下一页

173 results for proximal-policy-optimization

173 results
for proximal-policy-optimization