Closed charlesdedampierre closed 7 months ago
Hello!
This would be quite unusual. Mistral-7B is a great model, but it's a decoder (i.e. a model that produces text), not an encoder (i.e. a model that produces embeddings). A more common scenario would be to take a Sentence Transformer (encoder) model trained on sentence-similarity, use it with some embedding store/vector database to choose texts relevant to some query, and then feed those texts to the Mistral decoder. This is a standard Retrieval Augmented Generation setup that can be implemented using various different open source projects.
Hi,
Is it possible to fine-tune a model like mistralai/Mistral-7B-v0.1 (https://huggingface.co/mistralai/Mistral-7B-v0.1) on sentence-similarity task ? Any ressources ?
Thanks!
Charles