frazer-lab / cluster

Repo for cluster issues.
1 stars 0 forks source link

failed drive Node 14. Ticket opened. #279

Closed tatarsky closed 5 years ago

tatarsky commented 5 years ago

Forgot to enter this for long term tracking of failures.

Drive arrived. @hiroko replacing it.

AHPC ticket: 1013089

tatarsky commented 5 years ago

Attempt at swapping drive did not work. To be 100% sure we will need to reboot the unit in case the slot is in a state it doesn't want to talk to the replacement.

There is one job on n14 and I've suspended it from further jobs. I'll email so the person can be contacted.

tatarsky commented 5 years ago

Drive replaced. RAID1 rebuilt. Unit returned to service.