matomatical / jaxgmg

JAX-based environments and RL baselines for studying goal misgeneralisation.
MIT License
2 stars 0 forks source link