Open macadminrohit opened 7 years ago
It appears chronos supports a callback URL to be notified of job failures. https://github.com/mesos/chronos/pull/518/files
I am not sure about long-running jobs, however. Perhaps you can build your own tool by talking to the mesos HTTP APIs and checking for tasks from Chronos that have been running for over a certain time?
What is the best way to monitor the Chronos jobs when they have failed or halted, or long running?