Motivation
Stable-baselines3 (SB3) has introduced support for action masking (see here), which is a great feature. However, this API requires the user to provide an ActionMasker wrapper. The issue is that some environments (e.g., gym-microrts, pettingzoo) directly provides a vectorized interface, so there is no chance to apply this ActionMasker.
Feature
Extending MaskablePPO to work with vectorized environments natively. With this extension, SB3 + PPO + Mask can work in gym-microrts and pettingzoo.
Motivation Stable-baselines3 (SB3) has introduced support for action masking (see here), which is a great feature. However, this API requires the user to provide an
ActionMasker
wrapper. The issue is that some environments (e.g., gym-microrts, pettingzoo) directly provides a vectorized interface, so there is no chance to apply thisActionMasker
.Feature Extending
MaskablePPO
to work with vectorized environments natively. With this extension, SB3 + PPO + Mask can work in gym-microrts and pettingzoo.