Open Vishal-S-P opened 3 months ago
Hi,
I am training SEED-small model on OpenWebText dataset. After few iterations of training the loss value (eval and train) explodes and training becomes unstable. Has anyone encountered this issue before?
Hi,
I am training SEED-small model on OpenWebText dataset. After few iterations of training the loss value (eval and train) explodes and training becomes unstable. Has anyone encountered this issue before?