serving-tensors Search Results

1000+ results
for serving-tensors

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

irthomasthomas/undecidability #682

Codefuse-ChatBot: Development by Private Knowledge Augmentat…

- [ ] [codefuse-chatbot/README_en.md at main · codefuse-ai/codefuse-chatbot](https://github.com/codefuse-ai/codefuse-chatbot/blob/main/README_en.md?plain=1) # codefuse-chatbot/README_en.md at main ·…

irthomasthomas updated 2 months ago
2
uber/neuropod #485

Is nueropod designed to support tf.Example or sparse tensor …

It's actually two separate questions: 1. Is nueropod designed to support tf.Example? From the [material](https://eng.uber.com/introducing-neuropod/) I found, seems nueropod's design goal is: as lo…

helinwang updated 3 years ago
3
mlflow/mlflow #3570

[BUG] Unable to invoke models tracked by MLFlow that require…

Thank you for submitting an issue. Please refer to our [issue policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) for additional information about bug reports. For help with debu…

mohammedi-haroune updated 3 years ago
6
ddps-lab/tfserving-inference #47

tf serving 코드 분석하기

tf model server가 요청을 어떻게 처리하는지 파악하기위해 코드를 분석할 필요가 있다. 서버코드는 https://github.com/tensorflow/serving/tree/master/tensorflow_serving/model_servers 여기에 있고 관련된 추가 코드는 https://github.com/tensorflow/tensorfl…

kh3654po updated 1 year ago
16
vllm-project/vllm #5600

[Feature]: support Qwen2 embedding

### 🚀 The feature, motivation and pitch in the Mteb leaderboard, the current best embedding model is `Alibaba-NLP/gte-Qwen2-7B-instruct`. However, using the embedding endpoint on it returns the foll…

DavidPeleg6 updated 2 months ago
4
tensorlayer/seq2seq-chatbot #27

tensorflow server signature

I am trying to serve the model over tensorflow serving and I have created the below signature. But it doesnt seem to work. Please help me @pskrunner14 encode_seqs = tf.placeholder(dtype=tf.int64, …

nidhikamath91 updated 5 years ago
17
onnx/onnx-tensorrt #917

Assertion fail in fillShapeVector when using tf.image.crop_a…

## Description My attempts at performing an inference for a Faster-RCNN model lead to a segmentation fault of Python. The problem seems related to the `tf.image.crop_and_resize` operation. I can re…

vdel updated 1 year ago
2
Dao-AILab/flash-attention #658

Memory usage can't stop increasing

Hello, I found memory usage can't stop increasing when serving Qwen model. I'm using flash-attention==2.3.3 When I run the code below, the memory growth from 3.1g to 3.5g, and would continue growi…

miangangzhen updated 11 months ago
3
tensorflow/model-analysis #45

TFMA and TF KERAS 2.0 model on pretrained model

Hello all, I am referring here to stackoverflow that I have published couple of days ago: [https://stackoverflow.com/questions/56248024/tensorflow-model-analysis-tfma-for-keras-model] I didn't rec…

OrielResearchCure updated 3 years ago
7
triton-inference-server/server #5259

Suggestion to reduce RAM consumption

**Is your feature request related to a problem? Please describe.** So I'm trying to use tritonserver in my project. But it uses a lot of RAM for a single model. * Is this expected behaviour? * Ar…

oleks-popovych updated 1 year ago
12

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for serving-tensors

1000+ results
for serving-tensors