When the multiline parser configuration is wrong, the buffer for its chunks overfill the memory and sends a signal for the emitter to stop sending to cloudwatch. As a result we end up missing log data in cloudwatch.
SEV2-Major
To Reproduce
Re-integrate celery multiline parser with all pods
Break themultiline parser
Restart FluentBit daemonset
Restart notify pods
Tweak the FluentBit input definitions to include FluentBit debug logs
FluentBit will stop sending logs to cloudwatch for the affected pods
Expected behavior
Logs should be sent to cloudwatch consistently even if there is a config error the multiline parser. If the memory chunk gets filled, the offending logs should be dropped so that remaining logs can make it through.
Impact
Logs not being sent causes audit and compliance issues as well as hampers troubleshooting.
Impact on Notify team:
We can fall out of policy compliance, and will have increased difficulty troubleshooting issues
Additional context
Action item from FluentBit invalid timestamps incident.
Next steps
Reopen the AWS ticket on this issue and provide the configuration along with debug log samples to provide to support.
Describe the bug
When the multiline parser configuration is wrong, the buffer for its chunks overfill the memory and sends a signal for the emitter to stop sending to cloudwatch. As a result we end up missing log data in cloudwatch.
SEV2-Major
To Reproduce
Expected behavior
Logs should be sent to cloudwatch consistently even if there is a config error the multiline parser. If the memory chunk gets filled, the offending logs should be dropped so that remaining logs can make it through.
Impact
Logs not being sent causes audit and compliance issues as well as hampers troubleshooting.
Impact on Notify team: We can fall out of policy compliance, and will have increased difficulty troubleshooting issues
Additional context
Action item from FluentBit invalid timestamps incident.
Next steps
Reopen the AWS ticket on this issue and provide the configuration along with debug log samples to provide to support.