Open msbutler opened 5 months ago
cc @cockroachdb/disaster-recovery
@ajstorm You also asked for the following, but i dont quite follow what you're asking for. could you provide more context on what you were looking into and what you wanted to see?
- [ ] buffer consumption rangefeed source - amount of rangefeed buffer we're consuming at the source
- [ ] buffer consumption producer - amount of data buffered at the producer side
@ajstorm You also asked for the following, but i dont quite follow what you're asking for. could you provide more context on what you were looking into and what you wanted to see?
- [ ] buffer consumption rangefeed source - amount of rangefeed buffer we're consuming at the source - [ ] buffer consumption producer - amount of data buffered at the producer side
The latter might not be necessary anymore, because @dt seems to have pulled out the buffering on the producer side last night. What I was referring to on the source side is the buffer that backs this error message. IIUC, when that buffer fills, we're going to get a REASON_SLOW_CONSUMER
error and have to perform a catchup scan. If that's the case, it would be good to have metrics to know how close we're getting to that limit.
Adding a request for a metric that tracks how far behind we are when performing an initial backfill (as mentioned here).
I'll work on this issue soon.
After #125320, a @dt @ajstorm wished for more metrics. We should add these when we have dev time or wish we had them during a debugging session:
Source side
Destination side
Jira issue: CRDB-39500