Open faaany opened 2 months ago
Hi @echarlaix , this PR is a joint effort of @jiqing-feng, @ganyi1996ppo, and me. Could you pls help review this PR? Thanks a lot!
@yao-matrix
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
What does this PR do?
This PR enables Intel GPU support for Llama2 model inference in optimum-intel. Below is a code example: