openshift / openshift-sdn

Apache License 2.0
69 stars 63 forks source link

debug.sh: add since -3days to journalctl #332

Closed nak3 closed 7 years ago

nak3 commented 7 years ago

Since debug.sh collects journal log from all of the Nodes on the cluster, if each cluster had xxx GB log, the collector host's diskspace is overflowed. On top of that, current script takes much time due to the collection time and scp of journal log.

This patch adds --since -3days option to journalctl, then collects logs since 3days ago.

danwinship commented 7 years ago

The unboundedness was intentional. In particular, we want the logs from the most recent service startup, if possible. (But also, if the service has been restarted recently, we want to see the logs from before the restart too.)

Maybe in some contexts we don't care about that as much, but if so, and if there were problems in some particular customer case, then we could provide a modified debug.sh for them to run. We don't expect people to run debug.sh other than when we have asked them to (since it requires an engineer to actually go through the output to look for problems, rather than diagnosing them itself).