Open norikazu99 opened 1 year ago
Hi, getting same error for my custom multiagent environment with Multidiscrete action space.
Tried using different schedulers but got stuck with the same issue :
pbt_scheduler = PopulationBasedTraining(
time_attr='training_iteration',
metric="episode_reward_mean",
mode="max",
perturbation_interval=5,
quantile_fraction=0.25,
# Specifies the search space for these hyperparams
hyperparam_mutations={
"lambda": lambda: random.uniform(0.9, 1.0),
"clip_param": lambda: random.uniform(0.1, 0.5),
"lr": lambda: random.uniform(1e-3, 1e-5),
"train_batch_size": lambda: random.randint(1000, 60000),
},
)
pb2_scheduler = PB2(
time_attr='training_iteration',
metric= 'episode_reward_mean',
mode='max',
perturbation_interval=5,
hyperparam_bounds={
"lr": [1e-5, 1e-3],
"gamma": [0.9, 0.999],
},
quantile_fraction= 0.25,
require_attrs=True,
synch=False
)
What happened + What you expected to happen
The following error occurs when using pb2 with custom env of Multidiscrete action space. Custom environment does work when not using pb2.
Versions / Dependencies
Reproduction script
Issue Severity
High: It blocks me from completing my task.