ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bmazoure/ppo_jax #2

PPO's performance

Hi @bmazoure, Your PPO +JAX implementation caught my eyes and this is a really cool repo! Based on your [benchmark](https://wandb.ai/bmazoure/ppo_procgen_jax/reports/PPO-Procgen-JAX-version---V…

vwxyzjn updated 2 years ago
1
jurgisp/memory-maze #17

PPO Baseline for MemoryMaze

Hi, Great environment. Just wondering, is there a PPO baseline available for this environment?

subho406 updated 1 year ago
11
vwxyzjn/cleanrl #478

gymnasium.error.NameNotFound: Environment `BreakoutNoFramesk…

## Problem Description An error is reported when the file is runppo_atari.py Run the ppo.py file correctly error：gymnasium.error.NameNotFound: Environment `BreakoutNoFrameskip` doesn't exist. …

bryanwwy updated 1 month ago
2
mnoukhov/elastic-reset #1

Hi！About PPO baseline weight diff

Hi! This is nice work and it's easy but effective. I am wondering if you could open-source the PPO baseline model as well. I hope I can reproduce the results from Table 3 in the paper. It would be v…

wangskyGit updated 5 months ago
2
luchris429/purejaxrl #20

PPO Implementation Ignores Time Limits

Hi, The current PPO implementation does not seem to account for time limits. While the `EpisodeWrapper` from brax is used, which tracks a truncation flag ([source](https://github.com/google/brax/bl…

bheijden updated 4 months ago
4
ai-for-decision-making-tue/DRL-TOBM-CPR #1

Implementation project issues

Hi, thank you very much for providing the code. When I executed ppo_training_mh, I encountered the error code below. The package versions I use are the same as yours except pytorch-geometric(2.5.3) …

SuperMacholo updated 3 weeks ago
10
intel-analytics/ipex-llm #10854

RuntimeError: "fused_dropout" not implemented for 'Byte' whe…

**Machine: MAX1100** **ipex-llm: 2.1.0b20240421** **bigdl-core-xe-21 2.5.0b20240421 bigdl-core-xe-esimd-21 2.5.0b20240421** [Related PR](https://github.com/intel-analytics/ipex-llm…

Jasonzzt updated 4 months ago
3
f0uriest/interpax #39

numpy v2 support

Hi, First of all thanks for interpax! I see that interpax depends explicitly on numpy

cpascual updated 2 days ago
1
reiniscimurs/DRL-robot-navigation #153

Q?

Hi this a great project thankyou to share it , i transfert the code to ros2 humble and is working , now i change the algorithm to PPO but is not working can you give me some tips and tricks to implem…

AtmAbdelkader updated 3 days ago
4
laszukdawid/ai-traineree #15

Curiousity in PPO

## What Add Curiosity driven exploration to PPO. ## Why It's been shown [citation needed] that Curiosity improves agents' performance on sparse reward environments.

laszukdawid updated 2 years ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for ppo

1000+ results
for ppo