Open skyshine102 opened 3 weeks ago
It certainly should, but not yet in torch, the Jax version is good to go. We're working on a distributed torch one right now.
Looking forward to the feature! I'm torch user and I can test when you have initial release.
Does psgd kron optimizer work with FSDP or Deepspeed?