vitrivr's next-generation retrieval engine. It is capable of extracting and retrieving a wider range of multimedia objects such as audio, video, images or 3d models.
Some newer embedding models such as https://huggingface.co/intfloat/e5-mistral-7b-instruct require a one-sentence instruction that describes the retrieval task in addition to the content that should be embedded. This model in particular is currently implemented in the FES and accessible through the ApiWrapper. It might be useful to extend the DenseEmbedding analyser to support these models as well. To accomplish this the DenseEmbedding analyser (or its methods) needs the task instruction (e.g. 'Given a web search query, retrieve relevant passages that answer the query') as a parameter. I would like to ask for your feedback if this should be configured as part of a query or as part of the field. My intuition is that it is a parameter of the query and thus should be passed as query context to newRetrieverForContent... Any thoughts?
Some newer embedding models such as https://huggingface.co/intfloat/e5-mistral-7b-instruct require a one-sentence instruction that describes the retrieval task in addition to the content that should be embedded. This model in particular is currently implemented in the FES and accessible through the ApiWrapper. It might be useful to extend the DenseEmbedding analyser to support these models as well. To accomplish this the DenseEmbedding analyser (or its methods) needs the task instruction (e.g. 'Given a web search query, retrieve relevant passages that answer the query') as a parameter. I would like to ask for your feedback if this should be configured as part of a query or as part of the field. My intuition is that it is a parameter of the query and thus should be passed as query context to newRetrieverForContent... Any thoughts?