batch-inference Search Results

1000+ results
for batch-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/tensorrtllm_backend #572

Failed to launch triton server, the tensorrt_llm protobuf fi…

### System Info Docker image: nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3 Device: 8x H100 trt-llm backend: v0.11.0 ### Who can help? @byshiue @schetlur-nv ### Information - [ ] The off…

KuntaiDu updated 1 week ago
1
vllm-project/vllm #6189

[Feature]: Precise model device placement

### 🚀 The feature, motivation and pitch Hi all, I was wondering if it's possible to do precise model device placement. For example, I would like to place the vLLM model on GPU 1 and let GPU 0 do othe…

vwxyzjn updated 2 days ago
11
jonathan-laurent/AlphaZero.jl #78

refactoring mcts without using Recursion makes a little/much…

According the `scripts/profile/inference.jl`, I got the png below, which says on my 2080ti / 28 cores cpu machine, batch size 512 is the best, and it cost about '1.6e-5' per example. ![inference-gp…

magicly updated 2 years ago
2
NixOS/nixpkgs #245597

openvino with UHD Graphics 600 error "GPU program build fail…

### Describe the bug Using a Celeron J4125 I am trying to run OpenVINO, but get ``` [Step 7/11] Loading the model to the device [ ERROR ] Check 'false' failed at src/inference/src/core.cpp:114…

telent updated 1 month ago
4
SthPhoenix/InsightFace-REST #103

Can I request torch model files (scrfd_10g_gnkps)?

I was confirmed that the same problem occurred when a model using batch normalization was used. Thanks for solving the problem by using group normalization! 1. I need torch model files to use batc…

markdchoung updated 1 year ago
5
openai/CLIP #147

Features are slightly different when computing on a batch vs…

Given a prompt, the resulting embedding will be slightly different if it was computed in a batch (`batch_size > 1`) vs. if it was computed as a single inference. For example, computing the embeddin…

jerpint updated 1 year ago
1
triton-inference-server/server #6720

Triton Server Crash with Signal (11)

**Description** Triton Sever crashed after some period of time running inferences using Python Backend models. The Python backend models are running TensorRT models with [mmdeploy python api](https:/…

AbelDR updated 1 week ago
8
Lightning-AI/torchmetrics #2327

SSIM has values larger than 1

## 🐛 Bug It seems like SSIM can have values larger than 1 when computing over an epoch. I cannot reproduce this error but only observe it with tensorboard after training. ![image](https://github…

TuanDTr updated 1 week ago
8
triton-inference-server/server #5467

Configurable rate-limiting / queue policy for sequence batch…

**Is your feature request related to a problem? Please describe.** As documented [here](https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/user_guide/model_configuration.htm…

aw1cks updated 7 months ago
3
ReactiveBayes/RxInfer.jl #336

Some errors with CVIProjection

Error 1: ``` ERROR: PosDefException: matrix is not Hermitian; Cholesky factorization failed. Stacktrace: [1] non_hermitian_error() @ StaticArrays ~/.julia/packages/StaticArrays/MSJcA/sr…

acertain updated 1 month ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for batch-inference

1000+ results
for batch-inference