madecoste / swarming

Automatically exported from code.google.com/p/swarming
Apache License 2.0
0 stars 1 forks source link

Add support to quarantine a Bot, including an automatic health check #115

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Repro:
1. Run jobs on a specific bot and tasks always fails.

Expected:
The bot stops taking tasks and the admins are alerted.

Actual:
Bot continues to kill tasks repeatedly. It's going to use signal like bug 114.

The idea is to put the bot in Quarantine mode where it just hangs in there 
until someone fixes it.

Original issue reported on code.google.com by maruel@chromium.org on 12 Jun 2014 at 9:02

GoogleCodeExporter commented 9 years ago

Original comment by maruel@chromium.org on 12 Jun 2014 at 9:03

GoogleCodeExporter commented 9 years ago

Original comment by maruel@chromium.org on 16 Jun 2014 at 6:01

GoogleCodeExporter commented 9 years ago

Original comment by maruel@chromium.org on 17 Jun 2014 at 2:38

GoogleCodeExporter commented 9 years ago
I think the health check should be provided by start_slave.py.

Original comment by maruel@chromium.org on 17 Jun 2014 at 5:15

GoogleCodeExporter commented 9 years ago

Original comment by maruel@chromium.org on 17 Jun 2014 at 5:15

GoogleCodeExporter commented 9 years ago
Backend support added in b37d3079db45346f93098b115dd2babc5c9366c7. No API nor 
frontend yet.

Original comment by maruel@chromium.org on 23 Jul 2014 at 1:30

GoogleCodeExporter commented 9 years ago
It's important but I have too many things going on at the moment to work on it 
right now.

Original comment by maruel@chromium.org on 17 Sep 2014 at 6:54

GoogleCodeExporter commented 9 years ago
Lowering priority

Original comment by maruel@chromium.org on 30 Sep 2014 at 5:27

GoogleCodeExporter commented 9 years ago
Issue 114 has been merged into this issue.

Original comment by maruel@chromium.org on 13 Nov 2014 at 1:23

GoogleCodeExporter commented 9 years ago
This was implemented. The UI is described in issue 123.

Original comment by maruel@chromium.org on 2 Dec 2014 at 7:51