madecoste / swarming

Automatically exported from code.google.com/p/swarming
Apache License 2.0
0 stars 1 forks source link

Have better visibility about BOT_DIED tasks tries #202

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
https://chromium-swarm.appspot.com/user/tasks?state=bot_died only shows tasks 
that failed *twice* with BOT_DIED, as automatic retries will hide the internal 
failure.

From a user perspective, this is fine.

From an infrastructure monitoring perspective, this is bad. We need to see all 
the failures so we can get an earlier signal when things go sour.

Original issue reported on code.google.com by maruel@chromium.org on 22 Jan 2015 at 7:02