Closed danyaljj closed 2 years ago
Thanks for sharing the checkpoints! Wondering if there is a plot of perplexity as a function of steps #?
Information collected during training (ppl, evals etc) can be seen here: https://wandb.ai/eleutherai/mesh-transformer-jax/reports/6B-Rotary--Vmlldzo2NDQxNzY
Thanks for sharing the checkpoints! Wondering if there is a plot of perplexity as a function of steps #?