A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.
GNU Affero General Public License v3.0
14
stars
8
forks
source link
Remove redundant warm-up for to optimize chat speed #21
Closed
sgwhat closed 5 months ago
Checklist: