Closed smonsays closed 2 years ago
Hey Robert, I tried to jax.lax.scan over the bsuite-MemoryChain environment and noticed that the reward returned by this env is an int32 instead of a float32 as for the other environments. Was this on purpose? If not here would be the minimal fix.
jax.lax.scan
int32
float32
Best, Simon
Thank you Simon!
Hey Robert, I tried to
jax.lax.scan
over the bsuite-MemoryChain environment and noticed that the reward returned by this env is anint32
instead of afloat32
as for the other environments. Was this on purpose? If not here would be the minimal fix.Best, Simon