Closed zanussbaum closed 9 months ago
deepspeed sets the weights of the model in mixed precision, need to change precision of embeddings
deepspeed sets the weights of the model in mixed precision, need to change precision of embeddings