model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA-Merlin/models #129

Serving retrieval model in Triton

https://docs.google.com/presentation/d/1jxj9zjeRRu1BJSf8tzaWoQVr5h7HZbOUsSh39Rcwv80/edit#slide=id.g10be2c57ddf\_7\_3 Aha! Link: https://nvaiinfa.aha.io/features/MERLIN-672

viswa-nvidia updated 2 years ago
1
uwsampl/nexus #1

Support MXNet for model serving

yingfeng updated 6 years ago
1
vllm-project/vllm #9535

[Performance]: bitsandbytes quantization slow

### Proposal to improve performance Improve bitsandbytes quantization inference speed ### Report of performance regression I'm testing llama-3.2-1b on a toy dataset. For offline inference using the…

lance0108 updated 3 weeks ago
8
databricks/databricks-sdk-py #777

[FEATURE] Allow Dictionary Inputs for Complex Types in SDK

**Problem Statement** The SDK currently requires users to create specific object types (like EndpointCoreConfigInput, AiGatewayConfig, RateLimit, EndpointTag) when e.g. creating a serving endpoint (s…

djliden updated 1 month ago
1
matterport/Mask_RCNN #1694

Code for serving model with Tensorflow Serving's gRPC!

Hi, Thank you for your awesome repository, it's helps me so much on my personal project :100: :+1: I create this issue just want to share my code for serving model with Tensorflow Serving's gRPC. H…

huyhoang17 updated 4 years ago
2
deeptendies/legacy-deeptendies-library #24

Evaluate BentoML, TensorFlow Serving, and TorchServe for Mod…

Provide Pros, Cons, and final recommendation(s) https://docs.bentoml.org/en/latest/

mklasby updated 3 years ago
1
sql-machine-learning/sqlflow #2399

[Investigation] PyTorch Model Serving Solution

SQLFlow extends the SQL syntax to describe the end-to-end machine learning pipeline. The end-to-end solution includes the model serving. The data transformation logic is consistent between training a…

brightcoder01 updated 4 years ago
3
meta-llama/llama-stack #405

Model names mismatch with remote::vllm

### System Info It meets requirements.txt. Nvidia GeForce GPU. ### Information - [X] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug I'm using the remote::vllm s…

stevegrubb updated 5 days ago
3
Project-HAMi/HAMi #620

根据README的样例提交任务时，nvidia.com/gpu值超过1，pod的状态就一直为Pending

**What happened**: 根据README的样例提交任务时，nvidia.com/gpu值超过1，pod的状态就一直为Pending nvidia.com/gpu值为1时，pod调度正常 **What you expected to happen**: nvidia.com/gpu值大于1，pod调度正常 **How to reproduce it (as minim…

peisp updated 1 day ago
1
vllm-project/vllm #3531

[Feature]: FastServe - Fast Distributed Inference Serving fo…

### 🚀 The feature, motivation and pitch This paper might be of interest: https://arxiv.org/pdf/2305.05920.pdf This paper improves inference efficiency by determining the priority of each inference…

chizhang118 updated 2 months ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for model-serving

1000+ results
for model-serving