kingoflolz / mesh-transformer-jax

Model parallel transformers in JAX and Haiku
Apache License 2.0
6.27k stars 890 forks source link

Error fine-tuning train #186

Closed DimIsaev closed 2 years ago

DimIsaev commented 2 years ago

DELETED