Netflix / Fenzo

Extensible Scheduler for Mesos Frameworks
700 stars 116 forks source link

[Question] Recommendation for long running service-style task #108

Open ghost opened 7 years ago

ghost commented 7 years ago

For a framework based on Fenzo, what are the guidelines for scheduling service style tasks? I am looking for a use case to schedule mix of service and batch jobs. The queueable task input to Fenzo has no distinction for service or batch jobs. This implies that framework should restart the service job when the job finishes/fails. One way is to push the failed/finished service job back in the pending queue, and wait for Fenzo to schedule the job. However, this may lead to interruption of service till the time the prior pending jobs in the queue gets scheduled.

Is there any recommendation to handle restart for the service style tasks and also minimize the interruption of the service?

spodila commented 7 years ago

Here are some thoughts:

I spoke about capacity guarantees recently at QCon San Francisco. The slides are available here. The video should be available later from QCon.