Open flash-freezing-lava opened 1 month ago
any update on this ?
It seems like we're also blocked from using certain metrics, including the episode_reward_mean inside PB2 and PB, anyone having the same issue?
@sven1977 Any ideas what changed in the logged metrics here?
What happened + What you expected to happen
For the below script using Tune with RLlib, the output changed (from version
~=2.9.0
), and now contains irrelevant columns likenum_healthy_workers
instead of the default coumns likeepiside_reward_mean
.Expected output was like:
The cause seems to be that
episode_reward_mean
is now returned asenv_runners/episode_reward_mean
only, so it is not found when checkingDEFAULT_COLUMNS
and therefore not in the output_infer_user_metrics
guesses the irrelevant metricsVersions / Dependencies
Ray:
2.23.0
Python:3.11.9
OS: Arch LinuxOutput of
pip list
:Reproduction script
Issue Severity
Low: It annoys or frustrates me.