DUNE / dist-comp

Action items for DUNE distributed computing, and common scripts that are used.
2 stars 0 forks source link

Get Global Pool landscape monitoring plots easily found/accessible #81

Open StevenCTimm opened 1 year ago

Andrew-McNab-UK commented 1 year ago

For the log files, presumably their thing needs to know about the DUNE pool schedds and to poll them to transfer the log files from the finished jobs? Do we know who is responsible for that part of the system?

For the plots, at one point there were probes running for each schedd. https://github.com/HEPiX-batchmonitoring/fifemon-condor-probe

StevenCTimm commented 11 months ago

RITM1788320 is filed with Fermilab monitoring people They have got part of it done but not all of it yet.. and still have to get the landscape monitoring authorization signing key on Justin and osg.

StevenCTimm commented 4 months ago

So at the moment the graphite-based plots are working for the JustIN schedd but not for the Fermi-based schedds dunegpsched01/dunegpsched02 at least when those are submitted to with POMS... we still have to file a SNOW ticket on this.

There is ongoing discussion to see if we can get the full job history of the dune global pool into Kibana.