Open wingrime opened 1 year ago
L-BFGS - Is second order optimizer having good convergence but is memory intensive. Offloading optimizer state with deep-speed would improve method accessibility for users constrained in GPU-RAM
@wingrime, thanks for this suggestion. Are you able to provide a PR?
L-BFGS - Is second order optimizer having good convergence but is memory intensive. Offloading optimizer state with deep-speed would improve method accessibility for users constrained in GPU-RAM