tensorrt-llm Search Results

NVIDIA/TensorRT-LLM #2518

MPI Abort Error when using disaggServerBenchmark

### System Info - CPU: x86_64 (Ubuntu 20.04.6 LTS) - GPU: H100 * 8 - CUDA: 12.5.1 - TensorRT-LLM: The latest dev commit, 385626572df16175dd327fa785e4434cb7866a64 - TensorRT: 10.6.0 - Python: 3.10.14 …

zhangts20 updated 10 hours ago

NVIDIA/TensorRT-LLM #2494

[bug] forwardAsync assertion failed

My [version](https://github.com/NVIDIA/TensorRT-LLM/tree/31ac30e928a2db795799fdcab6be446bfa3a3998) [Assertion](https://github.com/NVIDIA/TensorRT-LLM/blob/31ac30e928a2db795799fdcab6be446bfa3a3998/cpp…

akhoroshev updated 1 week ago

NVIDIA/TensorRT-LLM #2524

pynvml version issue

### System Info Tensorrt-llm v0.14.0 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported tas…

apbose updated 20 hours ago

NVIDIA/TensorRT-LLM #2492

Wrong output on Llama 3.2 1B, but 3B ok

### System Info Both RTX 2070 and RTX A6000 ### Reproduction I'm using the latest main (535c9cc6730f5ac999e4b1cb621402b58138f819) I'm using the `make wheel` image, from main. I built the 3B model…

lucasavila00 updated 1 week ago

k2-fsa/sherpa #674

24.09-trtllm-python-py3 使用 tensorrt-llm==0.15.0.dev202410150…

Hi 您好，我根据您的代码，对 whisper-large-v3-turbo 这个模型进行编译部署，报错如下，我看 24.09-trtllm-python-py3 支持的 tensorrt-llm 是0.13.0.您那边测试是成功的吗？ ``` Traceback (most recent call last): File "/workspace/TensorRT-LLM/exam…

xqun3 updated 1 week ago

NVIDIA/TensorRT-LLM #2476

trtllm-bench fail

# trtllm-bench --model models/Llama-2-7b-hf throughput --dataset experiments/synthetic_128_128.txt --engine_dir models/Llama2-7b-trt-engine [TensorRT-LLM] TensorRT-LLM version: 0.15.0.dev2024111200 …

Wowoho updated 1 week ago

NVIDIA/TensorRT-LLM #2448

Unable to profile cpp benchmark due to NCCL error

### System Info - CPU: x86_64, Intel(R) Xeon(R) Platinum 8470 - CPU/Host memory size: 1TB - GPU: 4xH100 96GB - Libraries TensorRT-LLM: main, 0.15.0 (commit: b7868dd1bd1186840e3755b97ea3d3a73dd…

YJHMITWEB updated 2 weeks ago

NVIDIA/TensorRT-LLM #2380

Error while importing tensorrt_llm

After installation, getting error while importing tensorrt_llm: `ImportError: /opt/conda/lib/python3.10/site-packages/tensorrt_llm/bindings.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN3c106…

Aaryanverma updated 6 days ago

NVIDIA/TensorRT-LLM #2439

[bug] unnecessary batch logits post processor calls

[version](https://github.com/NVIDIA/TensorRT-LLM/tree/31ac30e928a2db795799fdcab6be446bfa3a3998) When I build model with paged_context_fmha = true and max_num_tokens = 4096, chunked context is enabled…

akhoroshev updated 2 weeks ago

NVIDIA/TensorRT-LLM #2475

undefined reference to `__libc_single_threaded'

### System Info System: - CPU Architecture: x86_64 - GPU: NVIDIA H100 - 80GB - CUDA 12.4 - TensorRT-LLM: main branch, commit 535c9cc6730f5ac999e4b1cb621402b58138f819 - Operating System: Ubuntu 22.04…

hoangvictor updated 1 week ago

1000+ results for tensorrt-llm

1000+ results
for tensorrt-llm