Open lukesteensen opened 4 years ago
Please also see #4670 for more details.
Jotting down some thoughts that came to mind when this was brought up recently. There was a lot of discussion around having a command similar to what the Agent "flare" command does, for Vector.
For some of the investigations I've looked into, here are some things that would be helpful to capture:
vector top
, basically total counts for some of our more important internal telemetry such as bytes/events sent/received, component errors, events dropped. And per component.
One of the biggest challenges when trying to help Vector users debug issues is collecting all of the relevant data from their environment. This can involve a lot of back and forth and it's not always clear which commands to run to get Vector-specific info like file checkpoints.
To address this, we should add a diagnostics subcommand (similar to homebrew's
doctor
command) that loads a user's config and gathers relevant information. That information can then be formatted nicely and output to the terminal or a file for the user to pass along to us.Each component can independently implement its own set of checks via another
build
-style method on the config traits. For example, the file source checks could return information like the following: