Closed thirteenflt closed 4 weeks ago
what is your vllm version?
vllm version is 0.4.3 nvidia-nccl-cu12==2.20.5
vllm version is 0.4.3 nvidia-nccl-cu12==2.20.5
vllm version is 0.4.3 nvidia-nccl-cu12==2.20.5
do you use our docker container (https://github.com/OpenLLMAI/OpenRLHF/tree/main/dockerfile) and NCCL between multiple nodes (such as IB)? I recommend you try vLLM 0.42 first, because for 0.43 we didn't test it enough.
Anyone has any clue on this error?