BerriAI / litellm

Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
https://docs.litellm.ai/docs/
Other
11.84k stars 1.37k forks source link

[Bug]: Azure OpenAI Embeddings via Cloudflare AI Gateway seem to have wrong #4626

Open Manouchehri opened 1 month ago

Manouchehri commented 1 month ago

What happened?

Looks like LiteLLM is sending YOUR_DEPLOYMENT_NAME twice in YOUR_RESOURCE_NAME/text-embedding-3-small/deployments/text-embedding-3-small/embeddings?api-version=2024-05-01-preview.

image

This is a recent regression I think. I had this config awhile ago and it was working fine.

Relevant log output

No response

Twitter / LinkedIn details

https://www.linkedin.com/in/davidmanouchehri/

Manouchehri commented 1 month ago

Oh hmm maybe not a regression, looking at the code I don't think this ever worked.