How can I use Multiple NPUs ?

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Apache License 2.0

36.85k stars 4.54k forks source link

Open QuentinWang1 opened 6 months ago

QuentinWang1 commented 6 months ago

for example : python3 -m fastchat.serve.cli --model-path lmsys/vicuna-7b-v1.5 --num-gpus 2

maybe this following command need to be supported? python3 -m fastchat.serve.cli --model-path lmsys/vicuna-7b-v1.5 --device npu --num-npus 2

wangshuai09 commented 6 months ago