Closed icheered closed 5 months ago
Thanks to https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/178 I found
from sb3_contrib.common.maskable.callbacks import MaskableEvalCallback
. Replacing my EvalCallback
with MaskableEvalCallback
fixed my issue. However this was nowhere in the docs, perhaps this should be added.
However this was nowhere in the docs, perhaps this should be added.
yes, it is only shown in the example. I would appreciate a PR that update the doc and add a note/warning about that ;)
🐛 Bug
I'm still investigating, but
EvalCallback
seems to crash MaskablePPO. Whenever I addEvalCallback(eval_env=env, eval_freq=10)
as a callback tomodel.learn
it will work until the 10th iteration, after which the environment'saction_masks
is no longer called.Code example
Relevant log output / Error message
System Info
Checklist