Closed Qubitium closed 1 year ago
Trying to fix triton model load on multi-gpu via cuda:0, cuda:1, etc. Please note this pr is not complete and does not resolve the underlying issue.
Trying to fix triton model load on multi-gpu via cuda:0, cuda:1, etc. Please note this pr is not complete and does not resolve the underlying issue.