Hi, I would like to ask that whether you are considering adding test cases on the hybrid env with respect to the reward value? For example, test whether the reward value is correct in some special states (like moving towards the goal, or moving away from the goal or terminal states). I think it would greatly improve the robustness of the environment. 👍
Hi, I would like to ask that whether you are considering adding test cases on the hybrid env with respect to the reward value? For example, test whether the reward value is correct in some special states (like moving towards the goal, or moving away from the goal or terminal states). I think it would greatly improve the robustness of the environment. 👍