multi-gpu error:MPI_Unknown_error for examples/apps/chat.py

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

https://nvidia.github.io/TensorRT-LLM

Apache License 2.0

8.27k stars 918 forks source link

multi-gpu error:MPI_Unknown_error for examples/apps/chat.py #2209

Open youxzAnt opened 2 weeks ago

youxzAnt commented 2 weeks ago

System Info

when i run

got

How can i fix it?

Who can help?

No response

Information

[ ] The official example scripts
[ ] My own modified scripts

Tasks

[ ] An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
[ ] My own task or dataset (give details below)

Reproduction

Expected behavior

actual behavior

additional notes

lfr-0531 commented 2 days ago

I cannot reproduce this issue. Can you have a try on the latest main branch?