ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Stable-Baselines-Team/stable-baselines3-contrib #179

Recurrent PPO

### 🐛 Bug Running Recurrent PPO on CartPole in a background notebook in Kaggle after 6 hours the task crashed before finishing ### To Reproduce It was a simple test on cartpole environment. Here th…

fede72bari updated 1 year ago
4
zemlyansky/ppo-tfjs #3

Feedback on Proposed New Features: Typescript, Save/Restore …

Hi Everyone, I've been using ppo-tfjs for the last month and find it to be an incredible library, thank you so much for making it! I've been working on a fork over at https://github.com/alistairhea…

alistairheath updated 1 month ago
3
thu-ml/tianshou #498

A question: LSTM + PPO

## A question - I want to build a small scale version of **Open AI Five** - And I learnt that it uses LSTM + PPO - suppose I build a network model using LSTM, then should I use this network for …

tesla-cat updated 9 months ago
2
jurgisp/memory-maze #17

PPO Baseline for MemoryMaze

Hi, Great environment. Just wondering, is there a PPO baseline available for this environment?

subho406 updated 1 year ago
11
MichaelTMatthews/Craftax_Baselines #2

Problem with policy save file in view_ppo_agent.py

When running the PPO baseline on my M1 Mac using the command `python ppo.py --save_policy`, I encounter `ValueError: Unrecognized name format` during the policy-saving process within the _save_network…

lbarazza updated 2 months ago
1
bmazoure/ppo_jax #2

PPO's performance

Hi @bmazoure, Your PPO +JAX implementation caught my eyes and this is a really cool repo! Based on your [benchmark](https://wandb.ai/bmazoure/ppo_procgen_jax/reports/PPO-Procgen-JAX-version---V…

vwxyzjn updated 2 years ago
1
ray-project/ray #45713

Release test rllib_learning_tests_pong_ppo_torch.aws failed

Release test **rllib_learning_tests_pong_ppo_torch.aws** failed. See https://buildkite.com/ray-project/release/builds/16725#018fe1f2-a6ac-4002-b08b-6d5c34f87e40 for more details. Managed by OSS Test …

can-anyscale updated 1 month ago
3
Learning4Optimization-HUST/H-TSP #3

train和evaluate报错

train.py 和config_ppo.yaml 中low_level_load_path是如何生成的 evaluate.py中设置了lower_model和upper_model 报错Encoder type cnn not supported! 4个upper_model都试过了加载后的encoder_type是cnn而非pixel 有更详细的训练或者验证介绍嘛

laborer123 updated 2 weeks ago
5
microsoft/DeepSpeedExamples #637

Step3 Is padding side right or not?

I was running the script from step3: python3 train.py --step 3 --deployment-type single_gpu The training.log shows this: A decoder-only architecture is being used, but right-padding was detected! …

AaronKemon updated 5 days ago
1
laszukdawid/ai-traineree #15

Curiousity in PPO

## What Add Curiosity driven exploration to PPO. ## Why It's been shown [citation needed] that Curiosity improves agents' performance on sparse reward environments.

laszukdawid updated 2 years ago
2

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for ppo

1000+ results
for ppo