torchserve Search Results

1000+ results
for torchserve

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/serve #2968

'503 Service Unavailable' for url 'http://0.0.0.0:8085/v1/mo…

### 🐛 Describe the bug This case is not working ：https://kserve.github.io/website/0.11/modelserving/v1beta1/torchserve/#deploy-pytorch-model-with-v2-rest-protocol. The isvc object is ready when usin…

lswjkllc updated 9 months ago
2
aws/sagemaker-huggingface-inference-toolkit #67

How to dynamically batch to help handle high load?

Hi there, I'm trying to deploy an endpoint that has bursts of high load. I'd like the endpoint to batch requests so we can increase through put under high load at the cost of a slight increase in l…

jambran updated 9 months ago
3
deeptendies/legacy-deeptendies-library #24

Evaluate BentoML, TensorFlow Serving, and TorchServe for Mod…

Provide Pros, Cons, and final recommendation(s) https://docs.bentoml.org/en/latest/

mklasby updated 3 years ago
1
pytorch/serve #2411

[RFC] Reduce TorchServe Docker Image Size

## 🚀 The feature - Reduce TorchServe CPU Image size by 25% using slim as the base image - Refactor TorchServe Dockerfile to support slim based CPU & GPU Docker images and setup docker ci github ac…

agunapal updated 1 year ago
1
pytorch/serve #1922

Torchserve prediction failed for transformer model

I have the following Torchserve handler and dockerfile, but I’m getting prediction failed: `from ts.torch_handler.base_handler import BaseHandler from transformers import AutoModelWithLMHead, Auto…

AllenAkhaumere updated 1 year ago
1
pytorch/serve #1581

Torchserve Workflow Fails at Medium QPS

We have 2 onnx models deployed in a GPU machine built on top of the nightly docker image. - The first model runs with 0 failure at 500 QPS (p99 latency < 8ms) during a 2-hour perf test. - The seco…

mossaab0 updated 2 years ago
6
aws/sagemaker-pytorch-inference-toolkit #117

Launch TorchServe without repackaging model contents

Currently, the startup code will repackage the model contents in `environment.model_dir` into TS format using the TS model archiver: https://github.com/aws/sagemaker-pytorch-inference-toolkit/blob/mas…

fm1ch4 updated 2 years ago
5
pytorch/serve #3049

CPU Launcher fails available check with venv

### 🐛 Describe the bug cpu launcher doesn't work ### Error logs ``` 2024-03-28T08:00:31,792 [DEBUG] W-9000-embeddings_1.1 org.pytorch.serve.wlm.WorkerLifeCycle - launcherAvailable cmdline: […

ElfoLiNk updated 8 months ago
3
pytorch/vision #5954

ffcv integration

### 🚀 The feature Integrate https://github.com/libffcv/ffcv for accelerated image decoding, preprocessing and loading ### Motivation, pitch I maintain [torchserve](https://github.com/pytorch/…

msaroufim updated 2 years ago
3
Project-MONAI/monai-deploy-app-sdk #458

[IMP] Setting up MONAI Model Server

**Is your enhancement request related to a problem? Please describe.** Over the last six months, we have been building towards packaging the MONAI model in different flavors. The focus of these effor…

vikashg updated 7 months ago
3

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for torchserve

1000+ results
for torchserve