CURG-archive / graspit_handop

Other
1 stars 2 forks source link

Create monitoring system for cluster #9

Closed jon-weisz closed 11 years ago

jon-weisz commented 11 years ago

Create a monitor that will email or text if the cluster becomes unstable or behaves badly.

jon-weisz commented 11 years ago

This should do something like: For server in server_dict: Test that server is not too busy Test if graspit is hung email or text someone if either condition is true for any active server.