serving-tensors Search Results

1000+ results
for serving-tensors

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlflow/mlflow #2830

[BUG] Predict N-dimensional data with pyfunc from MLmodel

### Willingness to contribute The MLflow Community encourages bug fix contributions. Would you or another member of your organization be willing to contribute a fix for this bug to the MLflow code ba…

aseltmann updated 4 years ago
2
flashinfer-ai/flashinfer #391

Can we integrate Flashinfer into gpt-fast?

Hi, in previous issues, you wrote that you planed to integrate flashinfer into some inference backend like gpt-fast. This will be very interesting! And may I ask can we integrate Flashinfer into gpt-f…

jianc99 updated 2 months ago
9
ollama/ollama #5826

Azurefile (NFS) causes very slow model loads - Mixtral 22B i…

### What is the issue? Trying to load Mixtral 8x22B model using an A100 GPU as a deployment in Kubernetes, but it isn't loaded after 6 minutes. Mistral 7B model is loaded fine. Here is the de…

juangon updated 3 months ago
10
tensorflow/recommenders-addons #365

Support `MultiWorkerMirroredStrategy` distributed training s…

I tried to explore available approaches for distributed training of large-scale recommendation models with huge embedding tables and tried to use TFRA `DynamicEmbedding` combined with `MultiWorkerMirr…

sivukhin updated 10 months ago
10
vitalets/github-trending-repos #7

New daily trending repos in Python

Subscribe to this issue and stay notified about new [daily trending repos in Python](https://github.com/trending/python?since=daily)!

vitalets updated 11 hours ago
31
onnx/tensorflow-onnx #1064

Cannot convert from saved_model

**Describe the bug** I got an error trying to convert from a saved_model.pb builded with tensorflow 2.3.0: `Traceback (most recent call last): File "C:\Program Files\Python38\lib\runpy.py", lin…

Fax3D updated 1 month ago
7
KomputeProject/kompute #52

Explore / discuss for potential ideas or improvements

Open issue to openly discuss potential ideas or improvements, whether on documentation, interfaces, examples, bug fixes, etc.

axsaucedo updated 3 years ago
26
vllm-project/vllm #6468

[Performance]: [Speculative Decoding] Measurement of Cost Co…

### Proposal to improve performance Recently, vLLM has been conducting a lot of work related to Speculative Decoding, and we often see remarkable achievements. For the Speculative Decoding algorit…

bong-furiosa updated 3 months ago
5
pytorch/serve #2770

Backend process failed

### 🐛 Describe the bug Command i have used to create model file `torch-model-archiver --model-name yolo_tiny --version 1.0 --model-file model.pth --handler handler.py torchserve --start --m…

naveenjr updated 11 months ago
3
triton-inference-server/server #6647

Tensorflow models: Add support to specify multiple signature…

First of all, thank you very much for all effort put into this project. From what I have seen in the past couple of weeks investigating it I am really impressed by the state and performance of it! …

NiklasA11 updated 6 months ago
19

上一页 1...24 25 26 27 28 29 30...100 下一页

1000+ results for serving-tensors

1000+ results
for serving-tensors