issues
search
michaelfeil
/
infinity
Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
https://michaelfeil.github.io/infinity/
MIT License
1.32k
stars
97
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update README.md
#167
sherwin684
closed
6 months ago
0
Does this work with re-rankers?
#165
cduk
closed
6 months ago
0
Safetensors or to be sure not to load pickled weights
#164
wllhf
closed
6 months ago
3
The embeddings are random When use multithreading requests
#163
xuwei6
closed
6 months ago
5
How to run or access infinity on hf a space?
#161
ffreemt
closed
6 months ago
1
float16 and other optimizations help?
#159
BBC-Esq
opened
6 months ago
6
benchmarks?
#158
BBC-Esq
closed
6 months ago
1
Love the repo! Wish I could help!
#157
BBC-Esq
opened
6 months ago
3
Some docstring and typing fixes
#156
lckr
closed
6 months ago
2
Move `.detach().cpu()` into `encode_core`, and option to use cuda streams
#155
jobright-jiyuan
opened
6 months ago
5
add async tokenization to reranker in torch
#154
michaelfeil
closed
6 months ago
1
Fp8 support
#153
michaelfeil
closed
6 months ago
1
Add bettertransformer to cli
#152
michaelfeil
closed
6 months ago
1
Dynamic loading - different models at request time / multiple models
#151
cduk
closed
2 months ago
5
[Docs] Add quantization / dtype doc
#150
michaelfeil
closed
4 months ago
0
Issue templates
#149
michaelfeil
closed
6 months ago
1
Update docs based on feeback.
#148
michaelfeil
closed
4 months ago
2
Multi-Modal Inference / Clip
#147
michaelfeil
closed
3 months ago
5
Question: Support for sparse embeddings?
#146
Matheus-Garbelini
opened
6 months ago
5
Update README.md
#145
michaelfeil
closed
6 months ago
1
update poetry lock - sentence-transformers 2.5.0
#144
michaelfeil
closed
6 months ago
1
Revert "Sentence transformers bump to 2.5.0"
#143
michaelfeil
closed
6 months ago
0
Sentence transformers bump to 2.5.0
#142
michaelfeil
closed
6 months ago
1
remove fastembed
#141
michaelfeil
closed
6 months ago
1
OpenAI models and update docs and
#140
michaelfeil
closed
6 months ago
1
shrink: docker image size by pruning venv
#139
peebles
closed
6 months ago
6
Adding mkdocs url
#138
michaelfeil
closed
6 months ago
0
add docs via mkdocs
#137
michaelfeil
closed
6 months ago
1
Content-Encoding: gzip
#136
andrew-at-rise
closed
1 month ago
7
add michaelfeil/bge-small as default model
#135
michaelfeil
closed
6 months ago
1
Quantization: int8
#134
michaelfeil
closed
6 months ago
1
add macos ci
#133
michaelfeil
closed
6 months ago
1
Ci multi os windows
#132
michaelfeil
closed
6 months ago
1
multiple os ci / python 3.12
#131
michaelfeil
closed
6 months ago
1
"msg":"Input should be a valid list"
#130
fishfree
closed
7 months ago
6
Reranker model fails to load (maidalun1020/bce-reranker-base_v1) - no max token length is set
#127
Matheus-Garbelini
closed
7 months ago
4
infinity_emb failed at startup using `torch.compile` when installed via pip
#126
beebopkim
closed
4 months ago
9
Support for instructur/instructor-xl models
#125
BBC-Esq
opened
7 months ago
9
expand: EngineArgs
#124
michaelfeil
closed
6 months ago
1
Support for nomic-ai/nomic-embed-text-v1.5
#123
SupreethRao99
closed
7 months ago
1
Adding torch.compile + fp16 + bettertransformer a CLI argument
#122
michaelfeil
closed
6 months ago
0
Asking to truncate to max_length but no maximum length
#121
semoal
closed
7 months ago
1
Support for Inferentia2 (draft)
#118
michaelfeil
closed
6 months ago
1
Optimum windows fix
#117
michaelfeil
closed
7 months ago
1
Torch + Cuda + Bert crashes abruptly on startup
#115
semoal
closed
6 months ago
10
Parity break with OpenAI API: /models
#114
MichaelMcCulloch
closed
6 months ago
4
bump st
#113
michaelfeil
closed
7 months ago
1
update hf_transfer improvement
#112
michaelfeil
closed
7 months ago
1
Create llama-index `InfinityEmbeddings` as langchain
#111
semoal
opened
7 months ago
12
Benchmarking
#110
michaelfeil
closed
7 months ago
1
Previous
Next