Open alexhernandezgarcia opened 10 months ago
It may be possible to get rid of get_rewards() altogether since it is probably used by the flow matching loss only and it does not need to get the 0s of the non-terminating states.
get_rewards()
It may be possible to get rid of
get_rewards()
altogether since it is probably used by the flow matching loss only and it does not need to get the 0s of the non-terminating states.