Open jcsp opened 1 year ago
The more specific case of this for timequeries is https://github.com/redpanda-data/redpanda/issues/10854, but we probably need a solution for all fetch requests.
Unassigning myself since #12021 doesn't quite get us there.
@BenPope could you drop a note on what the gap between the improvements from #12021 and this ticket?
@BenPope could you drop a note on what the gap between the improvements from #12021 and this ticket?
This issue hasn't seen activity in 3 months. If you want to keep it open, post a comment or remove the stale
label – otherwise this will be closed in two weeks.
This is still something we need to pursue.
If a system is overloaded, such that tiered storage reads (including timequeries) are not completing promptly, then Kafka clients will tend to close their connections and issue another request.
When a tiered storage read is in flight, we do not cancel it when the original Kafka request's client closes their connection: it will remain enqueued and run to completion.
On an overloaded system, this can lead to unbounded growth in the number of reads in flight.
There are a couple of angles to addressing this:
JIRA Link: CORE-1358