model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/TensorRT #3248

🐛 [Bug] Error when serving Torch-TensorRT JIT model to Nvidi…

## Bug Description I'm trying to serve torch-tensorrt optimized model to Nvidia Triton server based on the provided tutorial https://pytorch.org/TensorRT/tutorials/serving_torch_tensorrt_with_t…

zmy1116 updated 1 day ago
5
mosecorg/mosec #597

feat: Add compression encoding support

### Describe the feature Mosec serving model sometimes will be a intermediate representations for large model pipeline. So compression support could be important. ### Why do you need this feature? …

aseaday updated 1 day ago
4
ShishirPatil/gorilla #759

[BFCL] How to evaluate bitsandbytes 4bit and 8bit quantized …

I am encountering an issue with evaluating Bitsandbytes 4-bit and 8-bit quantized models on the Berkeley Function Call Leaderboard (BFCL). I have successfully quantized my models using Bitsandbytes an…

abdul-456 updated 1 day ago
2
canonical/knative-operators #243

Can't integrate rocks to `securityContext.runAsNonRoot`: `tr…

### Bug Description While working on `net-istio-webhook` extension rock for knative we had encountered a problem where we can't run rocks in `securityContext.runAsNonRoot`: `true` Kubernetes deploym…

misohu updated 4 days ago
3
allegroai/clearml-serving #81

update requirements.txt

"The file serving/model_request_processor.py imports torch, but torch is missing from serving/requirements.txt"

EvgeniiMoskovtsev updated 2 weeks ago
1
lablup/backend.ai-webui #1968

Let's Apply YAML editor in model-serving

### Main idea Since the model service executes its service based on `model-definition.yml` file, we need to provide an editable UI for the YAML file. With YAML editor, user can edit freely and notice…

lizable updated 1 week ago
1
kserve/modelmesh-serving #519

Quickstart broken: modelmesh-serving svc not created in --na…

**Describe the bug** The [quickstart install](https://github.com/kserve/modelmesh-serving/blob/main/docs/quickstart.md#run-the-installation-script) instructions no longer work correctly. After depl…

planetA updated 1 week ago
1
EricLBuehler/candle-vllm #44

Support chat serving for more models

Open this issue for tracking the progress of models supported in candle-vllm.

guoqingbao updated 3 months ago
7
InftyAI/llmaz #32

Support Deployment for serving most models

We support lws as the default workload, however, most of the cases mutli-hosts is not needed, even with Llama3.1 405B. So maybe this is a better choice.

kerthcet updated 3 months ago
9
instructlab/instructlab #2578

Integrate distributed training capabilities into CLI

**Is your feature request related to a problem? Please describe.** Extend the training parameters to allow for flags or a different cli option to be provide to allow for distributed training to be pe…

cooktheryan updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-serving

1000+ results
for model-serving