michaelfeil / infinity

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip
https://michaelfeil.github.io/infinity/
MIT License
1.32k stars 97 forks source link

Support for Inferentia2 (draft) #118

Closed michaelfeil closed 6 months ago

codecov-commenter commented 6 months ago

Codecov Report

Attention: Patch coverage is 35.89744% with 50 lines in your changes are missing coverage. Please review.

Project coverage is 72.67%. Comparing base (9f41ec4) to head (86f6461). Report is 4 commits behind head on main.

Files Patch % Lines
...ty_emb/infinity_emb/transformer/embedder/neuron.py 34.32% 44 Missing :warning:
...nity_emb/infinity_emb/transformer/utils_optimum.py 42.85% 4 Missing :warning:
...ibs/infinity_emb/infinity_emb/transformer/utils.py 50.00% 2 Missing :warning:

:exclamation: Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #118 +/- ## ========================================== - Coverage 74.25% 72.67% -1.59% ========================================== Files 31 32 +1 Lines 1678 1749 +71 ========================================== + Hits 1246 1271 +25 - Misses 432 478 +46 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.