Open xiaomabufei opened 6 months ago
Hello, thank you for your wonderful work. May I ask how you deal with the use of deepspeed and bf16 precision together, because I found that bf16 does not have dynamic loss scale
Hello, thank you for your wonderful work. May I ask how you deal with the use of deepspeed and bf16 precision together, because I found that bf16 does not have dynamic loss scale