issues
search
michaelfeil
/
infinity
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
https://michaelfeil.eu/infinity/
MIT License
971
stars
72
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
update default model_name to be unified name across routes
#179
michaelfeil
closed
3 months ago
4
model name is not consistent across endpoints
#178
bufferoverflow
closed
3 months ago
1
update lock
#177
michaelfeil
closed
3 months ago
1
update typing
#176
michaelfeil
closed
3 months ago
1
redirect to `/docs` and optional imports
#175
michaelfeil
closed
3 months ago
1
refactor `ENUM..TypeHint` into a function
#172
michaelfeil
closed
3 months ago
1
refactored more imports
#171
michaelfeil
closed
3 months ago
1
Embedding quant
#170
michaelfeil
closed
3 months ago
1
bump sentence transformers to v.2.6.0
#169
michaelfeil
closed
3 months ago
0
Create ISSUE_TEMPLATE
#168
michaelfeil
closed
3 months ago
0
Update README.md
#167
sherwin684
closed
3 months ago
0
Does this work with re-rankers?
#165
cduk
closed
3 months ago
0
Safetensors or to be sure not to load pickled weights
#164
wllhf
closed
3 months ago
3
The embeddings are random When use multithreading requests
#163
xuwei6
closed
3 months ago
5
How to run or access infinity on hf a space?
#161
ffreemt
closed
3 months ago
1
float16 and other optimizations help?
#159
BBC-Esq
opened
3 months ago
6
benchmarks?
#158
BBC-Esq
closed
3 months ago
1
Love the repo! Wish I could help!
#157
BBC-Esq
opened
3 months ago
3
Some docstring and typing fixes
#156
lckr
closed
3 months ago
2
Move `.detach().cpu()` into `encode_core`, and option to use cuda streams
#155
jobright-jiyuan
opened
3 months ago
5
add async tokenization to reranker in torch
#154
michaelfeil
closed
3 months ago
1
Fp8 support
#153
michaelfeil
closed
3 months ago
1
Add bettertransformer to cli
#152
michaelfeil
closed
3 months ago
1
Dynamic loading - different models at request time / multiple models
#151
cduk
opened
3 months ago
4
[Docs] Add quantization / dtype doc
#150
michaelfeil
closed
1 month ago
0
Issue templates
#149
michaelfeil
closed
3 months ago
1
Update docs based on feeback.
#148
michaelfeil
closed
1 month ago
2
Multi-Modal Inference / Clip
#147
michaelfeil
closed
3 weeks ago
5
Question: Support for sparse embeddings?
#146
Matheus-Garbelini
opened
3 months ago
4
Update README.md
#145
michaelfeil
closed
3 months ago
1
update poetry lock - sentence-transformers 2.5.0
#144
michaelfeil
closed
3 months ago
1
Revert "Sentence transformers bump to 2.5.0"
#143
michaelfeil
closed
3 months ago
0
Sentence transformers bump to 2.5.0
#142
michaelfeil
closed
3 months ago
1
remove fastembed
#141
michaelfeil
closed
3 months ago
1
OpenAI models and update docs and
#140
michaelfeil
closed
3 months ago
1
shrink: docker image size by pruning venv
#139
peebles
closed
3 months ago
6
Adding mkdocs url
#138
michaelfeil
closed
3 months ago
0
add docs via mkdocs
#137
michaelfeil
closed
3 months ago
1
Content-Encoding: gzip
#136
andrew-at-rise
opened
3 months ago
7
add michaelfeil/bge-small as default model
#135
michaelfeil
closed
3 months ago
1
Quantization: int8
#134
michaelfeil
closed
3 months ago
1
add macos ci
#133
michaelfeil
closed
3 months ago
1
Ci multi os windows
#132
michaelfeil
closed
3 months ago
1
multiple os ci / python 3.12
#131
michaelfeil
closed
3 months ago
1
"msg":"Input should be a valid list"
#130
fishfree
closed
3 months ago
6
Reranker model fails to load (maidalun1020/bce-reranker-base_v1) - no max token length is set
#127
Matheus-Garbelini
closed
4 months ago
4
infinity_emb failed at startup using `torch.compile` when installed via pip
#126
beebopkim
closed
1 month ago
9
Support for instructur/instructor-xl models
#125
BBC-Esq
opened
4 months ago
9
expand: EngineArgs
#124
michaelfeil
closed
3 months ago
1
Support for nomic-ai/nomic-embed-text-v1.5
#123
SupreethRao99
closed
4 months ago
1
Previous
Next