model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/serve #2183

Serving large models on multiple GPUs

### 🚀 The feature How to deploy a model service that spans multiple GPUs? ### Motivation, pitch I have a large model which I run via `torchrun`. I use the **FairScale** library to distribute the mo…

yurkoff-mv updated 1 year ago
4
kserve/modelmesh-serving #333

ONNX model serving And python GRPC client

I exported a pytorch (model.pt) model to ONNX: ``` def to_numpy(tensor): return tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() torch_model = torch.load(os.pa…

MLHafizur updated 9 months ago
14
sgl-project/sglang #1857

TP8 scheduling overhead is very high for small model, Llama …

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

hliuca updated 2 days ago
9
6758-Project/Docker #1

MS3 - Model Serving Flask app (30%)

Create a simple Flask app with the following endpoints: - [ ] **/predict** - predict the shot’s probability of being a goal given the inputs - Input: the input features to your model, compatible wit…

TimkLee updated 2 years ago
1
stikkireddy/mlflow-extensions #10

Diffusion model support

* research key serving frameworks that are optimized for diffusion models * typically diffusion models take a while to compute even with gpus so see if we can figure out a way to test and deploy them…

stikkireddy updated 2 months ago
3
javaidnabi31/Multi-Label-Text-classification-Using-BERT #13

How to save this model for serving?

Hi, Thanks for the great article. Can you help me how we can save the estimator for serving a purpose?

cherukuravi updated 4 years ago
1
NVIDIA/OpenSeq2Seq #501

Jasper model to use in tensorflow serving

Hello Guys, First of all, I would like to thank everyone for this amazing work. If I want to use the jasper model for tensorFlow serve.. How should I use the datalayer code for extracting features …

code-R updated 4 years ago
1
instructlab/instructlab #2258

`--enable-serving-output` should be the default to reveal vl…

**Describe the bug** It's frustrating to run i.e. `ilab -v data generate` only to get: ``` ... DEBUG 2024-09-12 13:23:22,053 instructlab.model.backends.vllm:205: vLLM serving command is: ['/opt/…

TomasTomecek updated 2 months ago
4
mlflow/mlflow #13724

[BUG] How to collect logs for the results of AML real-time i…

### Issues Policy acknowledgement - [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) ### W…

C1022G-Thanh-Tu updated 5 days ago
3
ginger0106/paper-notes #1

INFaaS: Managed & Model-less Inference Serving

https://arxiv.org/pdf/1905.13348.pdf https://dl.acm.org/citation.cfm?id=3321443

ginger0106 updated 5 years ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for model-serving

1000+ results
for model-serving