Open StevenCTimm opened 1 year ago
RITM1788320 is filed with Fermilab monitoring people They have got part of it done but not all of it yet.. and still have to get the landscape monitoring authorization signing key on Justin and osg.
So at the moment the graphite-based plots are working for the JustIN schedd but not for the Fermi-based schedds dunegpsched01/dunegpsched02 at least when those are submitted to with POMS... we still have to file a SNOW ticket on this.
There is ongoing discussion to see if we can get the full job history of the dune global pool into Kibana.
For the log files, presumably their thing needs to know about the DUNE pool schedds and to poll them to transfer the log files from the finished jobs? Do we know who is responsible for that part of the system?
For the plots, at one point there were probes running for each schedd. https://github.com/HEPiX-batchmonitoring/fifemon-condor-probe