Closed yiliu30 closed 1 year ago
Hi @StochasticRomanAgeev, could you please take the time to conduct a preliminary review? thanks :)
Hi @StochasticRomanAgeev @tushar2407 @MarcosRiveraMartinez, could you please take your time to review the PR, thanks :)
Hi @yiliu30, Done merging, thanks for doing this integration!
It would be great if you could include a brief description of update into CPU inference section in the README. I intend to provide this enhancement in a subsequent PR.
It would be great if you could include a brief description of update into CPU inference section in the README. I intend to provide this enhancement in a subsequent PR.
Sure, I'll add an introduction in a follow-up PR soon :)
1st PR for https://github.com/stochasticai/xTuring/issues/264 to integrate Intel-Extension-for-Transformers to support int8 model on the CPU-only devices.
Usage
TODO
@StochasticRomanAgeev