Open dongxiaolong opened 7 months ago
What do you need exactly for it to be supported? Is supporting the embeddings per token with compression enough?
What do you need exactly for it to be supported? Is supporting the embeddings per token with compression enough?
I appreciate your response. I believe it would be better to support both the colbert embedding model and the reranker model. Here are two distinct applications. In llamaindex and langchain as a embedding model: end2end retrieval ragatouille In this blog as a reranker model:colbert as a re-ranker
colbert is very intersting, I want to try it to on HF text embedding inference
Model description
ColBERT is a fast and accurate retrieval model, enabling scalable BERT-based search over large text collections in tens of milliseconds. ColBERT github ColBERT huggingface
Open source status
Provide useful links for the implementation
No response