Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
@Rohan138 This seems to be relevant to the policy_v1 vs v2 migration that we saw the other day. I'll leave you assigned for now. For some reason, we don't test compute_log_likelihood methods at all.
What happened + What you expected to happen
Something's broken with the SimpleQ TF2 action distribution, but I can't track down the bug. This doesn't happen with TF1/Torch.
Versions / Dependencies
ray 3.0.0dev0 (master) Ubuntu 20.04 tensorflow 2.7.0 (pypi)
Reproduction script
Issue Severity
Medium: It is a significant difficulty but I can work around it.