Closed cirrostratus1 closed 2 years ago
Hi, I'm a bot from the Ray team :)
To help human contributors to focus on more relevant issues, I will automatically add the stale label to issues that have had no activity for more than 4 months.
If there is no further activity in the 14 days, the issue will be closed!
You can always ask for help on our discussion forum or Ray's public slack channel.
Hi again! The issue will be closed because there has been no more activity in the 14 days since the last message.
Please feel free to reopen or open a new issue if you'd still like it to be addressed.
Again, you can always ask for help on our discussion forum or Ray's public slack channel.
Thanks again for opening the issue!
Search before asking
Ray Component
RLlib
What happened + What you expected to happen
Run AlphaZero algorithm with enabled video generation during evaluation. The process crashes due to observation not contained in the observation space:
The problem is that the observation only contains the value of the
obs
key, but should be a dict containing the observation and action mask. The error only occurs during evaluation, which is run on the local worker.Expected behavior: The script should run without errors and produce videos in the output folder.
The used environment is a modified version of the original CartPole environment for AlphaZero, where code for video creation was added and bug #19861 regarding the dtype of the observation was fixed.
Versions / Dependencies
Ray 1.8.0, Python 3.7, Ubuntu 18.04 LTS, ffmpeg
Reproduction script
Anything else
Occurs every time.
Are you willing to submit a PR?