oscinis-com / Awesome-LLM-Productization

Awesome-LLM-Productization: a curated list of tools/tricks/news/regulations about AI and Large Language Model (LLM) productization

Other

20 stars 4 forks source link

frameworks for self-hosted serving of llm and vector search #1

Closed michaelfeil closed 2 months ago

michaelfeil commented 11 months ago

Got some suggestions for frameworks for self-hosted serving of llm and related.

Embeddings from OpenAI clip.

Jina https://github.com/jina-ai/clip-as-service (Apache)

Text-embeddings:

My own project: infinity ( just add it if you like it) https://github.com/michaelfeil/infinity (MIT) https://github.com/huggingface/text-embeddings-inference (no opensource Licence)

LLM Inference as a service:

Huggingface/ TGI https://github.com/huggingface/text-generation-inference (no longer opensource licence after 1.0) Nvidia-TensorRTLLM (Apache2.0)

asdfjupyter commented 4 months ago

Thanks @michaelfeil! Will add soon. There are some other papers I noticed and will add over the weekend.

asdfjupyter commented 2 months ago

merged. Thanks @michaelfeil!