apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.48k stars 3.24k forks source link

[Enhancement] Improve Compaction Profiling and Logging #41753

Open Yukang-Lian opened 4 days ago

Yukang-Lian commented 4 days ago

Search before asking

Description

We propose several improvements to the compaction process, focusing on better profiling, logging, and error reporting. Below are the specific tasks:

  1. Add compaction profiling:

    • Profile the I/O and CPU time during compaction.
    • Reflect the data volume read from and written to cache/S3 during compaction.
  2. Enhance compaction status with cumulative error codes:

    • Add cumulative status error codes to track errors that occur during compaction more effectively.
  3. Add detailed parameters to compaction status:

    • Provide detailed information about recent compaction scheduling, including specific data about the last few compactions.
  4. Add logging at the compaction scheduling phase:

    • Improve logging at the compaction scheduling stage to make it easier to troubleshoot issues.
  5. Track progress for large compactions:

    • Add progress tracking for large-scale compactions to monitor their status.
  6. Optimize compaction logs:

    • Reduce unnecessary warning logs in both cloud and community environments to avoid log clutter.

Solution

No response

Are you willing to submit PR?

Code of Conduct

Yoruet commented 4 days ago

I want to try it,plz assign it to me.