onnx-grpc Search Results

triton-inference-server/server #7677

Triton ONNX runtime backend slower than onnxruntime python c…

**Description** When deploying an ONNX model using the Triton Inference Server's ONNX runtime backend, the inference performance on the CPU is noticeably slower compared to running the same model usi…

Mitix-EPI updated 1 week ago

triton-inference-server/onnxruntime_backend #265

Triton ONNX runtime backend slower than onnxruntime python c…

Mitix-EPI updated 1 month ago

mudler/LocalAI #2372

Problem with piper

**LocalAI version:** quay.io/go-skynet/local-ai:master-cublas-cuda12-ffmpeg **Environment, CPU architecture, OS, and Version:** INTEL7 - 12 cores, NVIDIA GTX 1060, 30GB RAM services: api: …

35develr updated 4 months ago

kserve/modelmesh-serving #500

How to Control the Number of Model Replicas in ModelMesh Ser…

### Description I am working with ModelMesh Serving deployed on a Kubernetes cluster and I am looking for a way to control the number of replicas for a specific model. My setup includes a Triton runt…

michael-nammi updated 3 weeks ago

ecaminero/ai-codereview #28

Desplegar AI para hacer CodeReview

Tarea: Implementar un servicio web (API) que exponga las capacidades del modelo de lenguaje LLaMA para realizar revisiones de código. Esto implica: - [ ] Preparación del modelo: Convertir el modelo L…

ecaminero updated 3 weeks ago

kserve/modelmesh-serving #333

ONNX model serving And python GRPC client

I exported a pytorch (model.pt) model to ONNX: ``` def to_numpy(tensor): return tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() torch_model = torch.load(os.pa…

MLHafizur updated 8 months ago

onnx/tensorflow-onnx #2261

Pinned protobuf version leads to dependency conflict with te…

tf2onnx>=1.15 pins protobuf~=3.20.2. Tensorflow >=2.13 requires tf2onnx >= 1.15 due to https://github.com/onnx/tensorflow-onnx/pull/2215 In order to use gRPC natively with M1/M2 chips, we need at…

Zahlii updated 6 months ago

FunAudioLLM/CosyVoice #485

Linux服务器上运行的，报错：RuntimeError: "addmm_impl_cpu_" not implemen…

部署在Linux服务器上的，没有修改requirements.txt内的依赖，运行后克隆声音报错如下： Traceback (most recent call last): File "/root/miniconda3/envs/cosyvoice/lib/python3.8/threading.py", line 932, in _bootstrap_inner self.…

jacksonzjh updated 4 days ago

zerollzeng/tiny-tensorrt #72

'pytrt.Trt' object has no attribute 'AddDynamicShapeProfile'

can I get help on how to run with dynamic shape input in python? can you add an example in python? ```py import cv2 import tritonclient.grpc as grpc_client import time import sys sys.path.appe…

mahesh11T updated 11 months ago

mudler/LocalAI #730

Prebuilt image `/tts` reports an error

**LocalAI version:** `quay.io/go-skynet/local-ai:v1.20.0-cublas-cuda12-ffmpeg` **Environment, CPU architecture, OS, and Version:** Linux 0f37a61ebb06 5.10.16.3-microsoft-standard-WSL2…

ddosakura updated 1 year ago

545 results for onnx-grpc

545 results
for onnx-grpc