Open sjrrr13 opened 7 months ago
minimal.py does not support single-node- multi-process Please check applications/run_mtbench.sh for examples, thank you!
Maybe minimal.py can also support. Please set torch_device="auto" in the code and not changing DIST_WORKERS=4 and just use python minimal.py
I've tried to load LADE distributively with
However, when I try to monitor GPU usage with
watch nvidia-smi
, I've found that only gpu:0 was used. I want to use Llama-2-70b-hf and it can't be loaded in only one GPU. What can I do to use all the GPUs? Is there any problem in my launch command?