tensorrt-llm Search Results

1000+ results
for tensorrt-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #2172

stack during import tensorrt_llm

### System Info - CPU x86_64 - GPU: L40 - tensorrt_llm: 0.11.0 - CUDA: 12.4 - driver: 535.129.03 - OS: CentOS 7 ### Who can help? When I tried to import tensorrt_llm, it got stuck. Through debuggi…

Howe-Young updated 2 weeks ago
4
deepjavalibrary/djl-serving #2498

TensorRT-LLM(TRT-LLM) LMI model format artifacts not found w…

## Description (A clear and concise description of what the bug is.) Model artifacts are in the (TRT-LLM) LMI model format: ` aws s3 ls *** PRE 1/ 2024-10-25 14:59:…

joshight updated 1 month ago
1
Lanyujiex/image-mirror #9

docker.io/tensorrt_llm/release:latest

docker.io/tensorrt_llm/release:latest

Lanyujiex updated 1 month ago
1
NVIDIA/TensorRT-LLM #2323

AttributeError: '_SyncQueue' object has no attribute 'get'

### System Info System Information: CPU architecture: x86_64 CPU/Host memory size: 2.0 TiB GPU Properties: GPU name: NVIDIA H100 80GB HBM3 GPU memory size: 80 GB (75016 MiB / 81559…

imadoualid updated 2 weeks ago
6
NVIDIA/TensorRT-LLM #2487

int4 not faster than fp16 and fp8

### System Info x86_64, Debian 11, L4 GPU ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supporte…

ShuaiShao93 updated 1 week ago
8
NVIDIA/TensorRT-LLM #2310

qwen2_1.5b+tp4 convert_checkpoint failed

### System Info cpu: x86_64 gpu: nvidia a100 ### Who can help? _No response_ ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [x] An officially suppo…

sun2011yao updated 1 week ago
4
NVIDIA/TensorRT-LLM #2381

CUDA Out of Memory Error when Running Nemotron-51B with Tens…

## Environment - **GPUs**: 4x NVIDIA A100 (80GB) (nvlink. azure Standard_NC96ads_A100_v4) - **TensorRT-LLM Version**: 0.15.0.dev2024102200 - **Environment**: Docker container - **Memory Usage per GPU…

ShivamSphn updated 2 weeks ago
2
IDEA-Research/GroundingDINO #320

TensorRT-LLM multimodal

Whether GroundingDINO can support TensorRT-LLM multimodal ？ [TensorRT-LLM multimodal ](https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/multimodal/README.md)

MouseSun846 updated 2 months ago
1
NVIDIA/TensorRT-LLM #2387

How to use Medusa to support encoder decoder model?

TRT-LLM version: v0.11.0 I'm deploying a bart model with medusa heads, and i notice this issue https://github.com/NVIDIA/TensorRT-LLM/issues/1946, then i adapted my model with follow steps: ``` 1…

TianzhongSong updated 4 days ago
2
NVIDIA/TensorRT-LLM #2392

Qwen2-72B w4a8 empty output

### System Info GPU: 4090 Tensorrt: 10.3 tensorrt-llm: 0.13.0.dev2024081300 ### Who can help? @Tracin May you please have a look, thank you very much ### Information - [ ] The official example sc…

lishicheng1996 updated 2 days ago
6

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for tensorrt-llm

1000+ results
for tensorrt-llm