tensorrt Search Results

1000+ results
for tensorrt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

janhq/cortex.cpp #1100

feat: Docker container for TensorRT-LLM environment

**Problem** TensorRT-LLM is one of the biggest features in jan. As the docker environments usually include all kind of stuff to run the software with it's full set of features, why not include it too…

iamkucuk updated 5 days ago
2
pytorch/TensorRT #3138

🐛 [Bug] The example of torch tensorrt cannot generate images…

## Bug Description The example of pytorch tensorrt cannot generate images correctly. There were no issues with version v2.3.0, but version v2.4.0 cannot generate images correctly. https://git…

pangyoki updated 1 week ago
3
autowarefoundation/autoware.universe #7916

Running source/install/setup.bash complains that it cannot f…

### Checklist - [X] I've read the [contribution guidelines](https://github.com/autowarefoundation/autoware/blob/main/CONTRIBUTING.md). - [X] I've searched other issues and no duplicate issues were…

mitsudome-r updated 6 days ago
1
NVIDIA/TensorRT-LLM #2045

AttributeError: 'PluginConfig' object has no attribute '_rem…

### System Info A10 tensorrt-cu12-10.2.0.post1 tensorrt-cu12-bindings-10.2.0.post1 tensorrt-cu12-libs-10.2.0.post1 tensorrt_llm-0.12.0.dev2024072300 python==3.10 ### Who can help? @Tracin …

xinliu9451 updated 3 weeks ago
6
NVIDIA/TensorRT-LLM #2052

LLAMA 3.1 8B Quantization failed from BF16 to FP8

### System Info GPU: NVIDIA T4 * 4 Driver Version: 550.54.15 CUDA: 12.4 Image: nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3 TensorRT-LLM version: 0.11.0 ### Who can help? No response…

Ryan-ZL-Lin updated 1 week ago
14
microsoft/onnxruntime #21457

TensorRT EP's inference results are abnormal.

### Describe the issue Inference results are outputting abnormally when using YOLOv7 models with TensorRT EP. We have confirmed that the results are normal when using CPU and CUDA. The issue wa…

c1aude updated 2 weeks ago
4
janhq/cortex.cpp #1154

epic: Finalize how Model Folder and model.yaml works

## Goal - We should have a model folder that is able to handle different models - Built-in models (e.g. `janhq/llama3:7b-tensorrt-llm`) - Huggingface GGUF repos with multiple quants (e.g. `ba…

dan-homebrew updated 1 day ago
19
NVIDIA/TensorRT-LLM #2206

Qwen-VL-Chat has an error

### System Info A100-PCIe-40GB Tensorrt-LLM-verison:0.12.0 ### Who can help? @sunnyqgg ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [x] An offi…

xiangxinhello updated 57 minutes ago
1
NVIDIA/TensorRT-LLM #2214

Cannot build quantized int8 models for Phi3 128k models [Ten…

### System Info - CPU x86_64 (intel i9) - 128G memory (RAM) - GPU: 1 x RTXA6000 - Libraries: - TensorRT-LLM 0.12.0 (stable) - TensorRT 10.3.0 - transformers 4.42.4 - CUDA vers…

louis845 updated 3 days ago
1
OpenPPL/ppq #575

导出的QDQ ONNX模型转为TensorRT后，速度比FP16的ONNX还要慢？

使用ProgramEntrance_1.py导出带qdq节点的onnx模型，速度比fp16的tensorrt模型还要慢是什么情况。

zhishao updated 4 days ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for tensorrt

1000+ results
for tensorrt