inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

diegodurli/flow-status-webpack-plugin #13

Suppress server spawning noise

Using this tool can generate an awful lot of log noise. See below for a sample of output from just a few minutes of use. I was reconfiguring my webpack configuration, so the output information is corr…

timoxley updated 8 years ago
3
LykosAI/StabilityMatrix #634

(Bug)When trying to install Prompt expansion V2 for inferenc…

ERROR: Could not install packages due to an OSError: [WinError 5] Access Denied: 'D:\\Stability Matrix\\Packages\\ComfyUI\\venv\\Lib\\site-packages\\onnxruntime\\capi\\onnxruntime_providers_shared.dll…

LadyFlames updated 3 months ago
2
elastic/kibana #193633

[ML] Add support for returning `pt_tiny_elser` from the `ml.…

In plumbing through the Security Assistant Knowledge Base API integration tests (https://github.com/elastic/kibana/pull/192665), I ended up having to add support for a `modelId` override to our API's …

spong updated 1 week ago
1
EleutherAI/lm-evaluation-harness #1963

How to use a vllm hosted model?

Are there docs on best practices for using vllm hosted models? I create a model using python -m vllm.entrypoints.openai.api_server --model model_path and try running it as lm_eval --model lo…

darsh-essential updated 3 months ago
1
lm-sys/FastChat #3430

[Bug]: Garbled Tokens appears in vllm generation result ever…

Currently, I'm using fastchat==0.2.36 and vllm==0.4.3 to deploy Qwen model for inference service. Here's the command for starting the service on my two servers. server1： `python3.9 -m fastchat.serve…

Jason-csc updated 1 month ago
1
BerriAI/litellm #5853

[Bug]: Proxy: Constant "Provider NOT provided" errors in log…

### What happened? In the proxy admin UI (v1.44.23 stable), I added an invalid model by mistake*, and now I'm getting constant error messages in the logs with no way I can see to stop them. The er…

liffiton updated 1 week ago
7
AliyunContainerService/gpushare-scheduler-extender #51

actual gpu mem in use exceeds gpu mem limit

deployment.yaml ```yaml apiVersion: apps/v1 kind: Deployment metadata: name: tensorrt-inference-server labels: app: tensorrt-inference-server spec: replicas: 1 selector: mat…

jasperzhong updated 1 year ago
8
baker-laboratory/RoseTTAFold-All-Atom #75

ValueError: No valid inputs were provided

I REALLY NEED YOUR HELP. I REALLY NEED RoseTTAFold-All-Atom. After completing the installation steps, I got an error message during runtime.I tried to change the path to an absolute path in "protein.y…

merryxiang updated 5 months ago
2
CompVis/depth-fm #16

OSError: runwayml/stable-diffusion-v1-5 does not appear to h…

Traceback (most recent call last): File "/home/lhs/project/nerf...wu/depth-fm-main/inference.py", line 113, in main(args) File "/home/lhs/project/nerf...wu/depth-fm-main/inference.py", lin…

wudabingm updated 3 weeks ago
9
huggingface/transformers #32478

[ I built it! ] Server application with on-the-fly quantizat…

### Feature request It would be immensely useful to have a server-application to serve up HF-Transformer and other Hub models as a service, similar to the how `llama.cpp` bundles the `llama-server`…

abgulati updated 1 month ago
1

上一页 1...32 33 34 35 36 37 38...100 下一页

1000+ results for inference-server

1000+ results
for inference-server