inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

0cc4m/KoboldAI #52

Can't load 4bit models on Rocm

Whenever I try to load 4bit models I recieve this message. I'm using the latest version of code and can load normal models just fine. I'm using a 6600xt. `` DEVICE ID | LAYERS | DEVICE NAM…

Infection321 updated 1 year ago
4
IEIT-Yuan/Yuan-2.0 #76

请问支持多GPU部署吗?

单机多GPU如何部署?可以提供一下脚本吗?

Ca11MeE updated 8 months ago
2
facebookresearch/seamless_communication #460

Deployment of Seamless M4T Model - Exporting text.decoder to…

#### Description I am currently working on deploying the Seamless M4T model for text-to-text translation on a Triton server. I have successfully exported the `text.encoder` to ONNX and traced it …

HesamAlavian updated 1 month ago
1
h2oai/h2ogpt #1189

[Question] Concurrent user requests handling for Windows sys…

I've discovered that vLLM is only available for Linux systems. Currently my h2oGPT setup has caused a backlog of requests on our Windows-based system, since requests are processed in a queue one at a …

hpxiong updated 9 months ago
2
riffusion/riffusion-hobby #81

Out of 4GiB VRAM memory at run server

Installation Guide for Riffusion App & Inference Server on Windows. After command **python -m riffusion.server --port 3013 --host 127.0.0.1** : > ╭─────────────────────────────── Traceback (most r…

Demification updated 1 year ago
9
matt-m-o/YomiNinja #43

Yomininja doesn't start after updating to 0.7.2

Updating my Yomininja results in the program not being able to start, I saw a similar issue but I can't really read this so I have no idea if it's related to that OCR engine, I use lens. ``` PS C:…

trektn updated 1 month ago
2
hpcaitech/EnergonAI #199

How to use dynamic batch features

Hello, I have launched the opt-125M inference, and send request to server with locust. but what ever config the max_batch_size, the InferenceEngine always run in batch_size =1. how can i use the dynam…

hudengjunai updated 1 year ago
1
microsoft/LightLDA #57

Fail to build alias row, capacity of row = 0 Floating point …

I use lightLDA to do new Document inference ,I changed new/Unseen Document to the libsvm file by the old vocabulary dictionary and generate datablock,then i read the mode server_0_table_0 and server_0…

landesire updated 6 years ago
2
GaParmar/img2img-turbo #89

ValueError: Unrecognized model in /root/autodl-tmp/img2img-t…

My server cannot connect to the Hugging Face website, so I manually downloaded the pretrained model used in the code and placed it in the `img2img-turbo-main` folder. After executing the command `pyth…

YijiFeng updated 6 days ago
2
kubeedge/sedna #289

bug from image "sedna-example-joint-inference-helmet-detecti…

I am deploying Example1：[Using Joint Inference Service in Helmet Detection Scenario](https://github.com/kubeedge/sedna/blob/main/examples/joint_inference/helmet_detection_inference/README.md). edge…

zz952332446 updated 2 years ago
9

上一页 1...67 68 69 70 71 72 73...100 下一页

1000+ results for inference-server

1000+ results
for inference-server