linkedin / cruise-control

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
https://github.com/linkedin/cruise-control/tags
BSD 2-Clause "Simplified" License
2.77k stars 595 forks source link

Improve per task observability through additional logging #2204

Closed allenxwang closed 1 month ago

allenxwang commented 1 month ago

Summary

  1. Why:

We want to track and analyze task execution for each task in the log analysis tool. But currently there is not enough information logged and the log does not have task UUID so it is difficult to track on the per task base.

  1. What:

Added more INFO logging with UUID.

Also fixed a bug where null UUID is logged by moving the call to clearCompletedExecution() (where uuid is reset to null) to after the logging.

Additional evidence

Execution succeeded: task Id: null; removed brokers: [103986]; demoted brokers: null; total time used: 874448ms.. 

Categorization