stripe / veneur

A distributed, fault-tolerant pipeline for observability data
MIT License
1.73k stars 174 forks source link

Fixing behavior where we bail on all batches if one fails #1019

Open andresgalindo-stripe opened 1 year ago

andresgalindo-stripe commented 1 year ago

Summary

No longer erring out if one batch fails

Motivation

We want as many metrics to try and get written, errors may be related to things like writing duplicate time series which shouldn't stop the entire loop.

Test plan

Rollout/monitoring/revert plan

andrewa-stripe commented 1 year ago

https://github.com/stripe/veneur/pull/1022 does the ?same?