Open aahouzi opened 1 week ago
Hi @aahouzi ,
We recently update ipex-llm
for Lunar Lake (LNL) support. You could refer to here regarding how to install ipex-llm
for LNL iGPU.
Besides, for all-in-one
benchmark, you could also try on test_api transformer_int4_fp16_gpu_win
.
Please let us know for any further problems :)
Type of issue
I conducted a benchmark on LNL iGPU for Arcee-lite model, which is based on the Qwen2 Architecture, and obtained via LLM distillation techniques. It turns out the model runs perfectly for [6x128], [6x256] configs, but when given large input prompts (in my case [1000x512]), it hangs without any error logs.
Same issue happened also for Supernova-lite model which is based on the Llama 3.1 architecture, except that none of: [6x128], [6x256] or [1000x512] configurations worked.
The model generation step hungs exactly on this line of code:
GPU Driver version
32.0.101.5737
What operating system are you seeing the problem on?
Windows 11