Closed sebinsunny closed 3 months ago
Attention: Patch coverage is 84.21053%
with 3 lines
in your changes missing coverage. Please review.
Project coverage is 90.80%. Comparing base (
5505b86
) to head (56098ae
). Report is 11 commits behind head on main.
This PR refactors the basebackups monitoring introduced in PR #615. Previously, we reset the basebackup progress file whenever a new basebackup request was made, which resulted in not catching a few cases where pghoard restarts. Now, the progress file is only reset when a backup is successful, and we also record the total bytes uploaded in the file for the previous basebackup. If there is a retry due to a pghoard restart or a failed backup request, we check if progress has been made; if it has not exceeded the bytes uploaded in the previous state, we emit a stalled metric. Also, added logging for upload progress for each file and snapshot stages in a basebackup operation.
[SRE-7476]
About this change - What it does
Resolves: #xxxxx
Why this way