Open juhannc opened 1 year ago
Hello, I wanted to use MBMPO and I couldn't get the example running. It would be great if you can help!
Logs: mbmpo_log.txt
Versions / Dependencies Ray: 2.1.0 Python: 3.9.13 OS: Ubuntu 22.04.2
Reproduction script-
from ray.tune import register_env from ray.rllib.algorithms.ppo import PPOConfig from ray.rllib.algorithms.sac import SACConfig from ray.rllib.algorithms.mbmpo import MBMPOConfig from ray.rllib.examples.env.mbmpo_env import CartPoleWrapper from ray.rllib.examples.env.mbmpo_env import PendulumWrapper from ray.rllib.evaluation.rollout_worker import RolloutWorker import os import time import wandb import numpy as np import tracemalloc import tensorboard if name == "main":
agent = (
MBMPOConfig()
.environment(env=PendulumWrapper,disable_env_checking=True)
.rollouts(num_rollout_workers=10,num_envs_per_worker=20)
.training(inner_adaptation_steps=1,maml_optimizer_steps=8,gamma=0.99,lambda_=1,lr=0.001,vf_clip_param=0.5,kl_target=0.003,kl_coeff=0.0000000001,inner_lr=0.001,num_maml_steps=15,model={'fcnet_hiddens': [32, 32],'free_log_std': True})
.framework("torch")
.build()
)
result = agent.train()
print(result)
Same issue for me on Ray 2.3.1. Have you solved it?
What happened + What you expected to happen
When using the MBMPO algorithm, the training stops after the first training of the dynamics ensemble. Even hours later, nothing happens.
What I except to happen is a continuing training.
Logs:
Versions / Dependencies
Ray:
3.0.0dev0
(commit:8b55e2d85301ffae02bd980b9e242e5671bf104c
) Python:3.8.10
OS: Ubuntu 20.04.5Reproduction script
Issue Severity
High: It blocks me from completing my task.