Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note XPU is already supported in stock DeepSpeed (upstream).
MIT License
57
stars
19
forks
source link
Support bf16 type for transformer inference kernel to support Ds_Chat #64
Add the flag to support bf16 type for transformer inference kernel to enable Deepspeed-Chat