Open sibbl opened 1 year ago
A quick update: Microsoft now allows up to 16 items in the array, not only a single one.
OpenAI currently allows a larger number of array inputs with text-embedding-ada-002. Azure OpenAI currently supports input arrays up to 16 for text-embedding-ada-002 Version 2. Both require the max input token limit per API request to remain under 8191 for this model.
Feature Request
Azure OpenAI only allows one single string to be part of an embeddings request. Other frameworks have a
chunk_size
orembed_batch_size
parameter for this.Describe the solution you'd like
I'd propose a
int? ChunkSize = null
parameter for theEmbeddingsRequest
. If it's> 0
, the there should be multiple requests being made withn
lines per requests.Describe alternatives you've considered
I did the chunking myself, but as other frameworks have this built-in, we might also want to add such a parameter here.
Additional context
Quote from MS docs about this limitation: