HubSpot / Singularity

Scheduler (HTTP API and webapp) for running Mesos tasks—long running processes, one-off tasks, and scheduled jobs. #hubspot-open-source
http://getsingularity.com/
Apache License 2.0
823 stars 188 forks source link

Count all lost tasks as inactive #2263

Closed rosalind210 closed 2 years ago

rosalind210 commented 2 years ago

In a previous PR, we updated our evaluation of TASK_LOST during a deploy to allow for certain lost tasks, specifically tasks lost because of the container resource limit bug, to retry. This had the unintended effect of getting other deploys with tasks lost stuck in a loop of waiting, because that task wasn't being relaunched and wasn't being counted as failed. Since the source of the container resource limit bug has been found, we are just removing this logic and going to count all lost tasks as failed.

ssalinas commented 2 years ago

🚢