-
- [ ] [codefuse-chatbot/README_en.md at main · codefuse-ai/codefuse-chatbot](https://github.com/codefuse-ai/codefuse-chatbot/blob/main/README_en.md?plain=1)
# codefuse-chatbot/README_en.md at main ·…
-
It's actually two separate questions:
1. Is nueropod designed to support tf.Example?
From the [material](https://eng.uber.com/introducing-neuropod/) I found, seems nueropod's design goal is: as lo…
-
Thank you for submitting an issue. Please refer to our [issue policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) for additional information about bug reports. For help with debu…
-
tf model server가 요청을 어떻게 처리하는지 파악하기위해 코드를 분석할 필요가 있다.
서버코드는 https://github.com/tensorflow/serving/tree/master/tensorflow_serving/model_servers 여기에 있고 관련된 추가 코드는 https://github.com/tensorflow/tensorfl…
-
### 🚀 The feature, motivation and pitch
in the Mteb leaderboard, the current best embedding model is `Alibaba-NLP/gte-Qwen2-7B-instruct`.
However, using the embedding endpoint on it returns the foll…
-
I am trying to serve the model over tensorflow serving and I have created the below signature. But it doesnt seem to work. Please help me @pskrunner14
encode_seqs = tf.placeholder(dtype=tf.int64, …
-
## Description
My attempts at performing an inference for a Faster-RCNN model lead to a segmentation fault of Python. The problem seems related to the `tf.image.crop_and_resize` operation. I can re…
-
Hello, I found memory usage can't stop increasing when serving Qwen model.
I'm using flash-attention==2.3.3
When I run the code below, the memory growth from 3.1g to 3.5g, and would continue growi…
-
Hello all,
I am referring here to stackoverflow that I have published couple of days ago: [https://stackoverflow.com/questions/56248024/tensorflow-model-analysis-tfma-for-keras-model]
I didn't rec…
-
**Is your feature request related to a problem? Please describe.**
So I'm trying to use tritonserver in my project. But it uses a lot of RAM for a single model.
* Is this expected behaviour?
* Ar…