Open vitteloil opened 1 month ago
Okay, thats concering and should not happen. there is no way to “autorecover” e.g. in case you run out of memory. I assume that is the case here.
Hard to guess what is the cause without more information about how you are using infinity via pip. Also check the usage instructions, I updated the tutorials recenty. @vitteloil
System Info
Hi, Trying to run infinity as the embeddings server for Dify. When there is an error running one POST on /embeddings, the server stops processing further requests.
Is seems https://github.com/michaelfeil/infinity/blob/main/libs/infinity_emb/infinity_emb/inference/batch_handler.py#L423 is not in a try clause which may be the root of this issue ?
Running : infinity_emb v2 --model-id BAAI/bge-small-en-v1.5
Info : WSL2 , python 3.11.9, infinity_emb==0.0.39,
Information
Tasks
Reproduction
Expected behavior
Whena POST to /embeddings fails , I expect next POSTs to be processed