LLNL / lmt

Lustre Monitoring Tools
GNU General Public License v2.0
67 stars 21 forks source link

Include non-ready MDTs in WAITING status #64

Closed behlendorf closed 7 months ago

behlendorf commented 9 months ago

As part of starting an MDT Lustre will enter the WAITING state and attempt to contact all MDTs. This can take a considerable amount of time when multiple MDTs are unavailable. Which MDTs are being waited on is available in the recovery_status file, it would be helpful to print this in ltop rather than 0s remaining which is what's currently output. Something like this for example.

0000     server1 data stale
0001     server2 WAITING on MDTs 0000 0002
0002     server3 data stale
0003     server4 WAITING on MDTs 0000 0002

Example recovery_status file for MDT0001

status: WAITING
non-ready MDTs:  0000 0002
recovery_start: 1702343662
ofaaland commented 7 months ago

Fixed by Commit 440ab6a