michaelfeil / infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
https://michaelfeil.github.io/infinity/
MIT License
1.06k stars 75 forks source link

Asking to truncate to max_length but no maximum length #121

Closed semoal closed 4 months ago

semoal commented 4 months ago
2024-02-29T06:34:41.018 app[17816011be4689] ord [info] Asking to truncate to max_length but no maximum length is provided and the model has no predefined maximum length. Default to no truncation.

Should we worry about this log? We've seen that you pushed commits that truncates to longest_first on torch, but not released yet, could be related Michael?

semoal commented 4 months ago

Not related to Infinity.

We're running https://huggingface.co/naver/efficient-splade-VI-BT-large-doc model with transformers package, and indeed there's not max_length in the config.json of the model