Open catid-saronic opened 2 weeks ago
Hi @catid-saronic, thanks for your interest in our code! We have not tested using our Shampoo code with DeepSpeed. For scaling up models, we have preliminary support for FSDP; however, this does require some model information.
If you're interested in getting things working with DeepSpeed, would be happy to help though. Let me know if you have any other questions.
Using the latest main to train a YoloV9e object detector:
Looks like there's some issue with this code when used from DeepSpeed?