inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kazuto1011/deeplab-pytorch #77

inference on pascal voc testset

How can I run inference and get .png segmentation mask on pascal voc test set, which is neede for pascal voc server submission?

Seohyeong updated 2 years ago
1
triton-inference-server/server #6583

Support for vLLM and TRT-LLM running in OpenAI compatible mo…

**Is your feature request related to a problem? Please describe.** I'd like to be able to run vLLM emulating the OpenAI compatible API to use vLLM as a drop-in replacement of ChatGPT. **Describe…

vecorro updated 1 month ago
19
hpcaitech/EnergonAI #198

OPT inference

hello, I want to just inference of pre-trained model in the terminal, but I don't want to run a HTTP server. How could I do that?

Joanna-0421 updated 1 year ago
2
sustainable-computing-io/kepler #1657

Proposal: Go-based Model Server

### What would you like to be added? Currently we have this project https://github.com/sustainable-computing-io/kepler-model-server based in Python that does many things.... Some of that belongs i…

dave-tucker updated 3 months ago
1
immich-app/immich #14160

OAuth with Keycloak Quota claim not set

### The bug I set "quota_immich" in the OAuth setting and also as user attribute in Keycloak, but in immich the quota doesnt set right on first login. On debug mode in the log is the claim passthr…

CoLuxe updated 1 day ago
11
SKrisanski/FSCT #39

Pytorch 2.2.1 and Multipule GPU's

I forked your wonderful program and updated the requirements.txt and a few other files to now work with python 3.10 and Pytorch 2.2.1. I also fixed some of the warnings for DataLoader and the Pandas…

Rotoslider updated 2 months ago
4
ggerganov/llama.cpp #10047

Bug: Certain RPC Servers cause major slowdown to Host machin…

### What happened? Version on all 3 Machines:3978 (ff252ea4) I just used Git pull and updated to all 3 machines and then rebuilt the cmake.All the machines are on ethernet.I am not sure if it is n…

GoudaCouda updated 1 week ago
4
triton-inference-server/tensorrtllm_backend #440

Deployement failed for BERT

I have a bert model that I am trying to deploy with Triton Inference Server using Tensorrt-LLM backend. But I am getting errors: ? Docker Image: 24.03 ? TensorRT-LLM: v0.8.0 Error: +-------+-…

vivekjoshi556 updated 6 months ago
1
huggingface/text-generation-inference #2739

`ssm` models have been deprecated in favor of `mamba` models

### System Info ## System Specifications 2024-11-10T21:20:44.880890Z INFO text_generation_launcher: Runtime environment: Target: x86_64-unknown-linux-gnu Cargo version: 1.80.1 Commit sha: https:/…

mokeddembillel updated 2 days ago
1
triton-inference-server/server #5622

triton inference client pinned to geventhttpclient==2.0.2, c…

**Description** PR [185](https://github.com/triton-inference-server/client/pull/185) pinned `geventhttpclient==2.0.2` due to a potential change in ssl_context_factory handling. The geventhttpcli…

brightsparc updated 4 months ago
5

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for inference-server

1000+ results
for inference-server