DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
33.6k
stars
3.94k
forks
source link
Switch from torch.cuda.amp.custom_fwd to torch.amp.custom_fwd(device=...) #5684
Open
loadams opened 1 week ago