model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #6781

[Performance]: Slow TTFT(?) for Qwen2-72B-GPTQ-Int4 on H100 …

I did some tests in order to find better parameter to speed up, and it appears that there hasn't been a significant change in TTFT (Time To First Token). Is my TTFT correct? I feel it might be a bit t…

cyc00518 updated 2 months ago
4
tensorflow/serving #2222

Ragged Tensor as an output from Tensorflow serving

## Bug Report ### System information - **OS Platform and Distribution**: macOS - **TensorFlow Serving installed from**: binary - **TensorFlow version**: 2.14.1 ### Describe the problem We us…

bajaj6 updated 5 months ago
5
dragonflyoss/Dragonfly2 #2177

Tensorflow Serving supports to download model with Dragonfly

### Feature request: - Tensorflow Serving: https://github.com/tensorflow/serving. ### Use case: ### UI Example:

gaius-qi updated 1 year ago
2
kserve/kserve #3606

Support overriding model mount path in model server containe…

/kind feature **Describe the solution you'd like** Currently it is not possible to specify at what path the downloaded model should be available in the model server container. The downloaded model…

cmaddalozzo updated 3 months ago
3
tensorflow/serving #2132

capping resources assigned to each model in multi model serv…

Is there a way to cap the number (e.g. CPU cores, CUDA MPS threads) of resources assigned to each model in a multi-model tensorflow server? The only way (straightforward way and not considering lower…

saeid93 updated 1 year ago
3
vllm-project/vllm #3717

[Feature]: Distribute sets of default chat template for mode…

### 🚀 The feature, motivation and pitch Thanks to our amazing community, we have gathered a set of good chat template for models. These template are useful when the original model's `tokenizer_config…

simon-mo updated 2 weeks ago
1
opendatahub-io/model-registry-bf4-kf #235

[model-controller] Setup e2e tests for model registry and se…

**Is your feature request related to a problem? Please describe.** In the [first implementation of model registry and serving implemention](https://github.com/opendatahub-io/model-registry/issues/173…

lampajr updated 11 months ago
1
vllm-project/vllm #5587

[Installation]: `ModuleNotFoundError: No module named 'numpy…

### Anything you want to discuss about vllm. Users may see the following error when trying to run vllm: ``` Traceback (most recent call last): File "", line 198, in _run_module_as_main Fi…

glibg10b updated 1 week ago
5
galeone/tfgo #58

Serving models with custom op using tfgo

Hi, I trained a model with custom op and export it using `saved_model` API and I would like to serve it using tfgo. However, since tfgo binds to tensorflow C library(more precisely `libtensorflow.so`)…

JiahuaWU updated 2 years ago
2
kserve/modelmesh-serving #486

Models deployed with ModelMesh-Serving get restarted on upgr…

**Describe the bug** KServe community follow an approach to release all repos together irrespective if there are code changes in independent repos or not. **For example**, In release `v0.11.2`, a …

vaibhavjainwiz updated 9 months ago
3

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for model-serving

1000+ results
for model-serving