michaelfeil / infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
https://michaelfeil.eu/infinity/
MIT License
975 stars 72 forks source link

Love the repo! Wish I could help! #157

Open BBC-Esq opened 3 months ago

michaelfeil commented 3 months ago

Plenty of options. Love your contributions and notes to CTranslate2 - perhaps its time for you to open more PRs.

Some of them could be easier than others:

Docs:

I recently added https://michaelfeil.eu/infinity as docs page.

External

Llamaindex or Haystack

Contribute outside of this repo, e.g. llamaindex or haystack are cool repos. #111 I am happy to review code there if you end up writing some.

Vector DBs:

Had some chat with people running vector dbs - if you have a favorite one and end up using

Showcase on Reddit / Linkedin / Medium / other mediums

Could help others to get started - potentially more useful than a 5% performance improvement. That could be cool to add.

HELP-NEEDED tickets

Tried to add some help needed tickets - pick what you want!

Benchmarking on H100

If you happen to have one haha

Jimmy-Newtron commented 1 month ago

You could rent an H100 GPU for few $/hour on Vast AI

image

SuperSecureHuman commented 1 week ago

I have some A100s, if you want some numbers from there