Preemo-Inc / gradientai-python-sdk

Interface for interacting with Gradient AI in Python
13 stars 2 forks source link

Enchancements to Embeddings: latency optimized/ debugging local model #12

Open michaelfeil opened 9 months ago

michaelfeil commented 9 months ago

A bit of a creative idea. Likely to be an interesting business concept, but at least a unique selling point. No other Embedding provider offers this, apart from a hacky do-it-yourself version of huggingface.

For batch-size one:

I would suggest to add a base / not fine-tuned encoder model (bge-large) with a SentenceTransformers like setup (ONNX-cpu or CTranslate2-cpu, which do not require torch). Users could then switch between local mode and API mode.

pip install gradientai[local-embedder]