Open dukelee111 opened 2 months ago
Hi @dukelee111 ,
I reproduced and got the same error. "Not able to determine model policy automatically
means that GLM-4-9B-Chat is not supported by AutoTP as shown here. It is not found in deepspeed's supported model list.
Please help to confirm if the GLM-4-9B-Chat is supported , thanks so much.
Docker images:intelanalytics/ipex-llm-serving-vllm-xpu-experiment
Tag:2.1.0b2
Image ID:0e20af44ad46
step: cd /benchmark/all-in-one edit config.yaml bash run-deepspeed-arc.sh
Attached the error trace details: