inference-engine Search Results

1000+ results
for inference-engine

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openjournals/joss-reviews #7427

[PRE REVIEW]: FPGAI Engine for Neural Network Training and I…

**Submitting author:** @umutcanaltin (Umut Can Altin) **Repository:** https://github.com/umutcanaltin/fpgai_compiler **Branch with paper.md** (empty if default branch): main **Version:** v1.0.0 **Edit…

editorialbot updated 2 weeks ago
9
hpcaitech/ColossalAI #6112

[BUG]: ColossalAI Inference example response empty result wi…

### Is there an existing issue for this bug? - [X] I have searched the existing issues ### 🐛 Describe the bug Git commit: 2f583c1549(Current master branch) ## code(Example code in colossal…

GuangyaoZhang updated 3 weeks ago
2
ultralytics/ultralytics #17247

post-process is not batch operation?

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

Willjay90 updated 22 hours ago
13
NVIDIA/TensorRT-LLM #802

Failing to inference multi-GPU Llama engine

**Env:** - Container: nvcr.io/nvidia/tritonserver:23.12-trtllm-python-py3 - TensorRT-LLM release: 0.7.1 - TRT-LLM backend repo tag: v0.7.1 - Model: Llama-2-70b - tritonserver deployed on 2 A10…

manarshehadeh updated 2 weeks ago
2
NVIDIA/TensorRT-LLM #2381

CUDA Out of Memory Error when Running Nemotron-51B with Tens…

## Environment - **GPUs**: 4x NVIDIA A100 (80GB) (nvlink. azure Standard_NC96ads_A100_v4) - **TensorRT-LLM Version**: 0.15.0.dev2024102200 - **Environment**: Docker container - **Memory Usage per GPU…

ShivamSphn updated 1 week ago
2
ultralytics/ultralytics #17850

Imx500 usage example error

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component Expo…

Magitoneu updated 1 day ago
2
ultralytics/ultralytics #17542

Post Processing Bottleneck on Jetson AGX Orin

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

TristanPuma1 updated 2 weeks ago
3
janhq/cortex.cpp #1745

bug: Cannot start Llama 3.2-1b-instruct

### Jan version 0.5.7-rc2-beta2024-11-28T03:35:22.905Z ### Describe the Bug Basically i downloaded that model to use with Jan but when i try to activate the model got this message on the logs: `…

vico93 updated 1 day ago
1
bitnami/charts #30649

[bitnami/vllm] feat: Add chart

### Name and Version bitnami/vllm 0.1.0 ### What is the problem this feature will solve? Add the helm chart for vllm - a high-throughput and memory-efficient inference and serving engine for …

zhekazuev updated 1 day ago
1
vllm-project/vllm #8513

[Usage]: Best engine arguments for large batch inference

### Your current environment irrelevant ### How would you like to use vllm What would be the arguments that would maximize overall throughput for large batch offline inference? More specifically, I…

alpayariyak updated 2 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for inference-engine

1000+ results
for inference-engine