issues
search
michaelfeil
/
infinity
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
https://michaelfeil.eu/infinity/
MIT License
959
stars
71
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add auth and classify endpoint to openai server
#237
Rololi
closed
1 month ago
1
deberta support
#235
Stealthwriter
closed
1 month ago
1
update poetry lock to latest
#234
michaelfeil
closed
1 month ago
0
Add einops as extra dependency
#233
chiragjn
closed
1 month ago
2
bump sentence-transformers to 3.0
#232
michaelfeil
closed
1 month ago
0
Include `einops` in docker image
#231
chiragjn
closed
1 month ago
2
add infinity server kwargs for device
#230
michaelfeil
closed
1 month ago
1
update docs: v2 cli and async request handling
#229
michaelfeil
closed
1 month ago
0
add v2 to CLI, launching multiple models
#227
michaelfeil
closed
1 month ago
0
API Token
#226
vladimirmujagic
closed
1 month ago
1
ValueError: No onnx files found
#225
netw0rkf10w
opened
1 month ago
6
Add revision and trust_remote_code to from_pretrained calls
#224
chiragjn
closed
1 month ago
2
Docker path not in readme
#222
hughesadam87
closed
1 month ago
3
update readme for docker run command
#221
michaelfeil
closed
1 month ago
0
The jinaai/jina-embeddings-v2-base-zh model reports an error when importing documents into RAG.
#220
edisonzf2020
closed
1 month ago
1
mxbai-rerank-large-v1 starup error
#219
edisonzf2020
closed
1 month ago
2
BAAI/bge-reranker-base startup error
#218
andrew-at-rise
closed
1 month ago
9
Loading models from local path
#217
vladimirmujagic
closed
1 month ago
2
docker compose with folder with models
#215
shuther
closed
1 month ago
1
Add option to enable permissive CORS headers to allow api access from…
#214
kir-gadjello
closed
1 month ago
3
Tensor-parallelism for multi-gpu support
#213
SalomonKisters
opened
2 months ago
1
0.0.33 bump
#212
michaelfeil
closed
2 months ago
0
Add fp32 as runtime dtype
#211
michaelfeil
closed
2 months ago
0
bump sentence-transformers and torch
#210
michaelfeil
closed
1 month ago
0
Support for Python 3.8 in infinity
#209
BarryRun
closed
2 months ago
1
Different results with mixedbread-ai/mxbai-embed-large-v1 model
#208
stephenleo
closed
1 month ago
3
API Key Authentication for Michaelfeil Infinity
#207
AjayKarma05
closed
1 month ago
8
Hanging after first embedding generated on MPS
#206
semoal
closed
1 month ago
3
Load local model
#204
jmoney
closed
2 months ago
1
Scores slightly off/get rounded up to 1.0
#203
ruben-vb
closed
1 month ago
6
refactor `BatchHandler` into `ModelWorker`
#202
michaelfeil
closed
2 months ago
1
fix-orjson
#201
michaelfeil
closed
2 months ago
0
Add `EngineArray` Multi-Model [1/3]
#200
michaelfeil
closed
2 months ago
1
Openapi tests
#199
michaelfeil
closed
2 months ago
0
Missing library for nomic-ai/nomic-embed-text-v1.5 model
#197
shubham-bnxt
closed
2 months ago
2
update offline-mode: deployment docs v2
#196
michaelfeil
closed
2 months ago
0
update infinity offline solution
#195
michaelfeil
closed
2 months ago
1
HF_HOME not respected
#194
WinsonSou
closed
2 months ago
6
Add a TextSplitter in LangChain to share the model of the embedding model
#193
Jimmy-Newtron
opened
2 months ago
0
japanese-reranker-cross-encoder-large-v1 does not work with CLI
#192
nassie256
closed
2 months ago
4
API-design: fix-astop
#190
michaelfeil
closed
1 month ago
0
Update README.md - add Contributors
#189
michaelfeil
closed
3 months ago
1
update defered moving to cpu & type hints improvement
#187
michaelfeil
closed
3 months ago
1
Customize Docker
#186
ibrahimfw
closed
3 months ago
1
Error in offline mode with `trust_remote code`: SFR-Embedding-Mistral and nomic does not work without `einops`
#185
prasannakrish97
closed
1 month ago
4
Update bug-report.yml
#184
michaelfeil
closed
3 months ago
0
pydantic cli / args validation
#183
michaelfeil
closed
3 months ago
1
python39 type hints
#182
michaelfeil
closed
3 months ago
1
FIX: import crossencoder without torch installed and git push of creds
#181
michaelfeil
closed
3 months ago
1
feat: add served_model_name argument for the infinity_server
#180
bufferoverflow
closed
3 months ago
3
Previous
Next