cBio / cbio-cluster

MSKCC cBio cluster documentation
12 stars 2 forks source link

Status of Hal Maintenance Project #384

Closed tatarsky closed 8 years ago

tatarsky commented 8 years ago

I will provide periodic updates of what is being done here just to provide some status for the curious. We will be rebooting mskcc-ln1 to clear open files in about 40 minutes and begin some firmware efforts shortly afterwards.

tatarsky commented 8 years ago

mskcc-ln1 has been rebooted to start the project.

tatarsky commented 8 years ago

Work so far proceeding as planned. Still plenty more but crossing off items of higher risk first.

tatarsky commented 8 years ago

Work continues as planned. Torque/Moab migration going well.

tatarsky commented 8 years ago

Some tests this evening of our updated GPFS modules are in progress.

tatarsky commented 8 years ago

Work continues this morning. Validation of final prep to retire former ROCKS head node in progress.

aday00 commented 8 years ago

Thanks for the updates! Thanks also @juanperin for following up via email. Read-only filesystem access to (username) whenever possible would be super. At least two of us in Fuchs lab have a paper submission deadline March 17.

tatarsky commented 8 years ago

There is no difference in requirements to provide read-only GPFS. It is dependent on the network core rebuild. When the network hardware rebuild is done we will evaluate where we are. But we remain on target for the Wednesday return to service.

tatarsky commented 8 years ago

Sorry for the silence many of my tasks today are at consoles of systems. Many firmware revisions updated and Torque/Moab operational on new server. Network re-cabling getting closer. We remain on scheduled.

tatarsky commented 8 years ago

We are approaching letting people on the system. One last validation.

tatarsky commented 8 years ago

SSH access enabled. Please read #385 and we will handle any reported item as quickly as possible.