Closed fehiepsi closed 4 months ago
This is useful for optimizing large parameters, like in LLM.
Also update for the new jax.tree pattern.
This is useful for optimizing large parameters, like in LLM.
Also update for the new jax.tree pattern.