inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/whisper.cpp #2364

BUGS: no-gpu not working and more bugs

First off. I am not a C programmer but I wanted to use the server.cpp and main.cpp for inference. Both have different commandline arguments and thus difficult to implement. Both do not recognise a bo…

DRomatzki updated 3 months ago
2
WICG/protected-auction-services-discussion #55

TEE Inference service

As part of last week's call, I'm raising this to request for more details about TEE inference service. Will ONNX runtime be supported in this inference service?

pstorm updated 3 weeks ago
16
sherlock-audit/2024-06-allora-judging #12

volodya - forecast-implied inferences can be set to any valu…

volodya High # forecast-implied inferences can be set to any value due to ForecastElements is not filtered by duplicate. ## Summary forecast-implied inferences can be set to any value due to Foreca…

sherlock-admin4 updated 2 months ago
2
trpc/trpc #6189

docs: `retryLink` guide should mention that user needs to se…

### Area of Improvement Right now if user didn't set `QueryClient` `defaultOptions.retry` to false, `trpc` will automatically fallback to this `retry` property's default value (which is `4`) and igno…

Senbonzakura1234 updated 1 week ago
6
roc-lang/roc #1255

Faster compilation when all exposed symbols have a type anno…

if all exposed functions/values in some module A have a type signature, and all types it imports are resolved (so we know they are defined), then a any other module B that imports this module can star…

folkertdev updated 1 week ago
1
BerriAI/litellm #6709

[Feature]: Triton embedding custom input params

### The Feature To support custom input params for Triton embedding server. ### Motivation, pitch Currently the input payload params of the Triton Embedding model call is fixed with below for…

suresiva updated 5 days ago
1
visionhong/visionhong.github.io #4

tools/YOLOv8-with-TensorRT-Nvidia-Triton-Server/

# YOLOv8 with TensorRT & Nvidia Triton Server | VISION HONG Intro [https://visionhong.github.io/tools/YOLOv8-with-TensorRT-Nvidia-Triton-Server/](https://visionhong.github.io/tools/YOLOv8-with-Tenso…

utterances-bot updated 3 months ago
2
InternLM/lmdeploy #1637

[Feature]- Support for the microsoft/Phi-3-vision-128k-instr…

### Motivation The latest release of microsoft phi3 4.2b 128k context vision model looks promising in performance and resource saving one too as it boast just 4.2b parameter. So it would be a great f…

sabarish244 updated 4 months ago
3
triton-inference-server/server #3926

Does triton-inference-server run on Drive AGX?

**Is your feature request related to a problem? Please describe.** 1. We would like to try parallel model execution on iGPU+DLA devices. Is it possible to run triton-inference-server on a V3NP or Ori…

jayxio updated 2 years ago
1
intel-analytics/ipex-llm #11914

ipex Llama.cpp server fails with Phi3 models

Hi, I've trying to serve different Phi3 models using the Llama.cpp server that is created by the init-llama-cpp ipex. When I server with this version I have two problems: 1) The server doesn…

hvico updated 2 months ago
2

上一页 1...29 30 31 32 33 34 35...100 下一页

1000+ results for inference-server

1000+ results
for inference-server