Preemo-Inc / gradientai-python-sdk

Interface for interacting with Gradient AI in Python
14 stars 2 forks source link

Enchancements to Embedding: async and mini-batching or for large vector stores #11

Closed michaelfeil closed 9 months ago

michaelfeil commented 1 year ago

As embedding models are often used in conjunction with RAG, following functionalities would be helpful

I implemented a minimal client which supports the above features https://gist.github.com/michaelfeil/8c6829158e7dd6bb5fdf805b5ea2978c

If async is not on the roadmap, I would at least suggest to split documents in sorted order and send them with a ThreadPool