pytorch / TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
https://pytorch.org/TensorRT
BSD 3-Clause "New" or "Revised" License
2.5k stars 344 forks source link

❓ [Question] Is there any way to deploy on a single machine with multi-gpus? #3092

Open SZ-ing opened 3 weeks ago

SZ-ing commented 3 weeks ago

❓ Question

What you have already tried

Environment

Build information about Torch-TensorRT can be found by turning on debug messages

Additional context

As the title, I have a machine with multiple GPUs and I would like to know if there is any way to evenly distribute the model across these GPUs. Is there any way to achieve this?

narendasan commented 3 weeks ago

Take a look at these tutorials:

There are many tools out there to help convert a model to one that can run on multiple GPUs that can help automate this: https://www.deepspeed.ai/tutorials/automatic-tensor-parallelism/ https://huggingface.co/docs/accelerate/basic_tutorials/launch