vectordotdev / vector

A high-performance observability data pipeline.
https://vector.dev
Mozilla Public License 2.0
17.36k stars 1.51k forks source link

Observability audit #1843

Closed binarylogic closed 4 years ago

binarylogic commented 4 years ago

We've come across a few issues where logging was not present and failed to help the user understand what Vector was doing:

  1. https://github.com/timberio/vector/issues/1752
  2. https://github.com/timberio/vector/pull/1668#issuecomment-582165886
  3. https://github.com/timberio/vector/issues/1842

I'd like to audit our components to ensure that observability as a whole is adequate. Trying to brute force this in a short amount of time isn't a good idea, so I'd prefer to do one component at a time spread across our team.

Tasks

Global

Sources

Transforms

Sinks

binarylogic commented 4 years ago

This is blocked by #1761 since it probably makes more sense to perform this after that is complete. It's likely we'll want to change how we're observing Vector in general and incorporate metrics into this process.

LucioFranco commented 4 years ago

I'll add another one I found today, we should ensure that healthchecks get a span that associates it with the sink that its health checking.

Feb 18 15:55:19.677 ERROR vector::topology::builder: Healthcheck: Failed Reason: unknown API key - check your credentials

This above doesn't tell me which sink actually failed and may be confusing.

binarylogic commented 4 years ago

Closing in favor of #3192.