ppo-pytorch Search Results

1000+ results
for ppo-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Pillars-Creation/ChatGLM-RLHF-LoRA-RM-PPO #2

[BUG/Help] 请问作者是在单卡A100 40G显存条件下跑通全部流程的吗？包括后续的PPO阶段（需要同时塞两个模…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 请问作者是在单卡A100 40G显存条件下跑通全部流程的吗？包括后续的PPO阶段（需要同时塞两个模型） ### Expected Behavior _No response_ ##…

BIT-Xu updated 9 months ago
4
duckietown/gym-duckietown #278

Can't find pytorch_rl.

When I run python3 pytorch_rl/main.py --no-vis --env-name Duckietown-small_loop-v0 --algo a2c --lr 0.0002 --max-grad-norm 0.5 --num-steps 20,there is an erro that tell me does't has the directory pyto…

eisbzuwnaj updated 1 year ago
1
rlworkgroup/garage #1181

Add RNN support to torch/PPO and torch/TRPO

ryanjulian updated 3 years ago
3
huggingface/trl #1953

Error in PPOv2

I met the following error in PPOv2. Would you mind providing me some hints on why that happens? Traceback (most recent call last): Traceback (most recent call last): File "/home/ec2-user/SageMa…

ZhichaoWang970201 updated 2 weeks ago
1
JDBumgardner/stone_ground_hearth_battles #45

Bot playing cards while dead

It appears that a bot (not sure which one) somehow played a Vulgar Homunculus when it was already dead, resulting in the following exception. Traceback (most recent call last): File "/Users/etha…

ethansaxenian updated 3 years ago
1
ikostrikov/pytorch-a2c-ppo-acktr-gail #206

Recurrent states not reset between episode boundaries

The policy is given the last recurrent state from the replay buffer and isn't reset between episode boundaries. In my case I have the number of updates set to the episode length, so I've added `rollou…

bamos updated 4 years ago
3
openai/spinningup #308

Invalid MIT-MAGIC-COOKIE-1 key

I am getting a strange comment I had not seen before when running any spinup.run. i.e.: ```` python3 -m spinup.run ppo --hid "[32,32]" --env Walker2d-v2 --exp_name mujocotest ```` Then, immediat…

rojas70 updated 2 years ago
2
ray-project/ray #25001

[Core][RLlib][Tune] CUDA PTX error when training with Tune

### What happened + What you expected to happen ## 1 Training a PyTorch-based policy with Tune inside a container results in an error: `CUDA error: the provided PTX was compiled with an uns…

jdchn updated 1 year ago
1
kandouss/kamarl #4

Risk of division by zero

Hi! I just spotted a potential flaw, that might cause divisions by zero. [https://github.com/kandouss/kamarl/blob/master/kamarl/ppo.py#L485](https://github.com/kandouss/kamarl/blob/master/kamarl/p…

MarcoMeter updated 2 years ago
1
uvipen/Super-mario-bros-PPO-pytorch #18

size issue on GAE process

While study your Mario PPO codes, https://github.com/uvipen/Super-mario-bros-PPO-pytorch/blob/master/train.py, it’s hard to understand the following codes: #########################################…

davincibj updated 10 months ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for ppo-pytorch

1000+ results
for ppo-pytorch