-
### The Feature
To support custom input params for Triton embedding server.
### Motivation, pitch
Currently the input payload params of the Triton Embedding model call is fixed with below for…
-
We have two folder `model` and `models` they are used for storing embedding models and llms respectively.
Use one folder for both instead (models)
To complete this feature you will need to look fo…
-
```
field required [type=missing, input_value={'name': 'NPOS', 'type': 'Communication'}, input_type=dict]
For further information visit https://errors.pydantic.dev/2.9/v/missing
```
Here is co…
-
**Is your feature request related to a problem? Please describe.**
Embedding models typically have smaller context windows than LLMs, which can limit the quality of embeddings generated for large con…
nkkko updated
1 month ago
-
In this repo the Llama3 tokenizer sets the `` special token to `128011` https://github.com/meta-llama/llama-models/blob/ec6b56330258f6c544a6ca95c52a2aee09d8e3ca/models/llama3/api/tokenizer.py#L79-L101…
-
Hi,
Two models that were uploaded recently (https://huggingface.co/minishlab/M2V_base_glove and https://huggingface.co/minishlab/M2V_base_output) do not have Model Size and Embedding Dimensions on …
-
### Describe your problem
I'm unable to load this image that I built without embedding models on my macbook.
docker image ls
REPOSITORY …
-
In our doc guide for [MLModel -> Updatable Models -> Pipeline Classifier](https://apple.github.io/coremltools/docs-guides/source/updatable-tiny-drawing-classifier-pipeline-model.html#get-the-embedding…
-
### Description
Recently, I have been using Graphrag to index patent documents. The embedding model used is BGE-M3, and the document is divided into paragraphs with some additional segmentation rul…
-
### Describe your problem
First of all, this is a great project and I would like to thank you for your effort on this project. 😆
I would like to share a knowledge base along with it's embedding …