inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/onnxruntime_backend #125

Cannot build r22.03 onnxruntime_backend with tensorrt

**Description** I was unable to build the onnxruntime_backend with OpenVino for Triton Inference Server r22.03 using compatible ONNXRuntime and tensorrt versions (from Triton Inference Server compati…

ZJU-lishuang updated 2 years ago
2
Azure/azure-sdk-for-python #30501

Unable to deploy the trained model using Azure SDK v2

I have created and trained the model whose directory looks something like this - ![image](https://github.com/Azure/azure-sdk-for-python/assets/52820564/c099259a-9cd0-4935-b7f6-4784b531c966) The ma…

rishav771 updated 5 months ago
2
cpacker/MemGPT #1757

HTTPError: 522 Server Error: for url: https://inference.mem…

inference.memgpt.ai endpoint on error: File "/usr/local/lib/python3.11/site-packages/requests/models.py", line 1024, in raise_for_status raise HTTPError(http_error_msg, response=self) requests.…

quantumalchemy updated 1 week ago
1
pytorch/serve #3045

gRPC Model Metadata using Open Inference Protocol

### 🐛 Describe the bug Consider a system where a feature service fetches model metadata that has information on what feature to fetch and finally infer from the model. In order for me fetch this me…

harshita-meena updated 6 months ago
1
gradio-app/gradio #9378

Hangs at loading shards then get a OOM error.

### Describe the bug I've gone through all the steps to install Sora and the last step of running gradio/app.py it fails about 2/3 of the way. It hangs on loading shards at 0% and then get the follow…

blacknoon updated 1 week ago
1
Azure/azureml-examples #2192

The Forecast TCN model deployment through the UI does not wo…

### Operating System Windows ### Version Information Recently we have discovered a problem due to the error in the DNN scoring script file. **Please see the workaround in Additional information sec…

nick863 updated 10 months ago
1
pytorch/serve #2951

Open Inference Protocol with nightly build not working

### 🐛 Describe the bug While trying to run load tests with latest merged changes on v2 Open inference protocol, I noticed that the example for mnist does not work in preprocessing step. https://git…

harshita-meena updated 6 months ago
14
sillsdev/serval #49

Live inference off of NLLB-200 with GPU

This would be for SIL Converters

johnml1135 updated 10 months ago
6
triton-inference-server/server #6110

ModelStreamInferResponse Proto should use serialized grpc er…

**Description** Currently triton server doesn't capture the full serialized grpc error message in the message field. proto: https://github.com/triton-inference-server/common/blob/1df32b982a6ed11ead…

dhaval24 updated 1 year ago
4
microsoft/DeepSpeed-MII #258

Server crashes whilst trying to spin up Mistral

As described, trying to spin up a `mistralai/Mistral-7B-v0.1` using the examples in the README. This is on an EC2 `g5.xlarge`. ``` import mii client = mii.serve("mistralai/Mistral-7B-v0.1") resp…

harryjulian updated 10 months ago
8

上一页 1...75 76 77 78 79 80 81...100 下一页

1000+ results for inference-server

1000+ results
for inference-server