Open ziozzang opened 3 months ago
Did just run into this issue as well. Would be great if we could support this new model.
Also the embedding model: https://huggingface.co/BAAI/bge-multilingual-gemma2
Chiming in to express interest as well. This model is great!
It might require adding a few extra runtime args for layer cutoff, compression, etc. Also maybe the ability to choose prompts at inference time?
You can use bge-reranker-v2.5-gemma2-lightweight with the following different prompts:
- Predict whether passage B contains an answer to query A.
- Predict whether passages A and B have the same meaning.
- Predict whether queries A and B are asking the same thing.
- Predict whether argument A and counterargument B express contradictory opinions.
Really cool stuff, hoping this can get picked up!
Model description
bge-reranker-v2.5-gemma-lightweight 's performance is better bge-m3 :)
Please support model.
Open source status
Provide useful links for the implementation
https://huggingface.co/BAAI/bge-reranker-v2.5-gemma2-lightweight