michaelfeil / infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
https://michaelfeil.eu/infinity/
MIT License
977 stars 72 forks source link

Update docs based on feeback. #148

Closed michaelfeil closed 1 month ago

michaelfeil commented 3 months ago

User feedback: I would suggest the following:

cduk commented 3 months ago

Client code can be as simple as: curl -X POST -H 'Content-Type: application/json' -d '{"model": "bge-small-en-v1.5", "input":"A very large brown cat ate the pigeon."}' http://embeddinghost:7997/embeddings

michaelfeil commented 3 months ago

@cduk Would it make sense to generate an example with OpenAI client an openapi client?