Closed ys950902 closed 3 months ago
@tjruwase would you help review?
@rogerxfeng8 @tjruwase should ipex import under try except block, right ?
@polisettyvarma, good catch. I think ipex import could also be under if get_accelerator().device_name() == 'xpu':
similar to cuda
case.
@rogerxfeng8, can you please help fix. Thanks!
@polisettyvarma, good catch. I think ipex import could also be under
if get_accelerator().device_name() == 'xpu':
similar tocuda
case.@rogerxfeng8, can you please help fix. Thanks!
Thanks for your suggestions, I have modified it on https://github.com/microsoft/Megatron-DeepSpeed/pull/436.
For cuda fused_rms_norm is set by default, support fused_rms_norm on XPU device.