RTSYork / VLAB

The RTS Virtual Lab
https://wiki.york.ac.uk/display/RTS/The+RTS+Virtual+Lab
GNU General Public License v3.0
9 stars 2 forks source link

Improved automatic monitoring #33

Open iangray001 opened 2 years ago

iangray001 commented 2 years ago

The current state of the cluster is currently not visible for monitoring.

The containers could report into a suitable monitoring service to ensure uptime of the relay and all of the boards that are mentioned in vlab.conf.

Equally, just because a boardserver is alive doesn't mean the FPGA is. When not in use, a service should periodically connect to each board, attempt to program the FPGA, and (if the board is a Zynq) attempt to send a binary, in order to check that everything is working as expected.