remove hits beyond max requested hit

quickwit-oss / quickwit

Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.

Other

6.99k stars 291 forks source link

Description

due to the 1st case of what's described in https://github.com/quickwit-oss/quickwit/issues/3650 , it can happen that we return up to start_offset + 2 * max_hits documents instead of max_hits. This happens when a split first fails, and then succeed, but its result is just concatenated instead of doing a proper top-k. The fix consist in dropping the tail hits at the same time as the hits that should be omitted due to start_offset are removed.

How was this PR tested?

tested with a modified s3 that errors on a fraction of requests. Without the patch, i often get too many results (and sometime a double error so no response), with the patch, i either get the right number of docs, or said error.

quickwit-oss / quickwit

remove hits beyond max requested hit #5180

Description

How was this PR tested?

On SSD:

On GCS: