vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
https://docs.vllm.ai
Apache License 2.0
30.79k stars 4.68k forks source link

[New Model]: Snowflake Arctic Embed (Family) #7792

Open K-Mistele opened 3 months ago

K-Mistele commented 3 months ago

The model to consider

The Snowflake arctic embed family of embeddings models - a very small but performant-for-size series of embedding models. https://huggingface.co/Snowflake/snowflake-arctic-embed-xs https://huggingface.co/Snowflake/snowflake-arctic-embed-s https://huggingface.co/Snowflake/snowflake-arctic-embed-m https://huggingface.co/Snowflake/snowflake-arctic-embed-m-long https://huggingface.co/Snowflake/snowflake-arctic-embed-l

The closest model vllm already supports.

Closest supported embedding model: https://huggingface.co/intfloat/e5-mistral-7b-instruct Closest model by family: https://huggingface.co/Snowflake/snowflake-arctic-instruct

What's your difficulty of supporting the model you want?

Only mistral embeddings are supported at this time.

github-actions[bot] commented 3 days ago

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!