centerforaisafety / cerberus-cluster

HPC cluster code and configurations for running on OCI
Universal Permissive License v1.0
4 stars 0 forks source link

Integrate weka-mon with our grafana #179

Closed steven-safeai closed 1 year ago

ghost commented 1 year ago

Working through this on the headnode on our prometheus stack to test- got to this part today will finish in the morning: [opc@prometheus-headnode export]$ /usr/local/bin/weka-mon/export/export -v -c /usr/local/bin/weka-mon/export.yml INFO:Timeout set to 10.0 secs INFO:looking for token file auth-token.json INFO:Checking for /usr/local/bin/weka-mon/export/auth-token.json INFO:Using authfile /usr/local/bin/weka-mon/export/auth-token.json ERROR:Weka API error caught: Login attempt failed ERROR:Weka API error caught: Login attempt failed ERROR:Weka API error caught: Login attempt failed ^CCRITICAL:SIGINT received, exiting [opc@prometheus-headnode export]$

ghost commented 1 year ago

Done PR for docs just added.