microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
https://www.deepspeed.ai/
Apache License 2.0
33.6k stars 3.94k forks source link

Switch from torch.cuda.amp.custom_fwd to torch.amp.custom_fwd(device=...) #5684

Open loadams opened 1 week ago