michaelfeil / infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
https://michaelfeil.eu/infinity/
MIT License
971 stars 72 forks source link

update defered moving to cpu & type hints improvement #187

Closed michaelfeil closed 3 months ago

michaelfeil commented 3 months ago

This adds the fix for #155 to crossencoder as well as fixes some type hints

codecov-commenter commented 3 months ago

Codecov Report

Attention: Patch coverage is 70.00000% with 6 lines in your changes are missing coverage. Please review.

Project coverage is 73.52%. Comparing base (7b10965) to head (049a86a). Report is 7 commits behind head on main.

Files Patch % Lines
...inity_emb/infinity_emb/transformer/acceleration.py 44.44% 5 Missing :warning:
...emb/infinity_emb/transformer/crossencoder/torch.py 83.33% 1 Missing :warning:

:exclamation: Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #187 +/- ## ========================================== - Coverage 73.59% 73.52% -0.07% ========================================== Files 32 32 Lines 1791 1809 +18 ========================================== + Hits 1318 1330 +12 - Misses 473 479 +6 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.