mental2008 / awesome-papers

Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
https://paper.lingyunyang.com/
MIT License
45 stars 3 forks source link

SOSP '21 | Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP #19

Closed mental2008 closed 1 year ago

mental2008 commented 3 years ago

Presented in SOSP '21. [ Paper | Video | Code ]

Authors: Deepak Narayanan, Fiodar Kazhamiaka, Firas Abuzaid, Peter Kraft, Akshay Agrawal, Srikanth Kandula, Stephen Boyd, and Matei Zaharia Stanford University, Microsoft Research

mental2008 commented 3 years ago

Question

How to solve a resource allocation problem (in many computer systems, e.g., cluster scheduling, traffic engineering, load balancing)?

  1. Formulate it as a mathematical optimization problem and find an exact solution (using off-the-shelf solvers). => Not feasible for large problem sizes with tight SLAs.
  2. Existing systems choose heuristic algorithms. => Not optimal allocations.

Solution

Propose a technique named Partitioned Optimization Problems (POP).

  1. Randomly split the problem into smaller problems.
  2. Coalesces the resulting sub-allocations into a global allocation.

Tradeoff between allocation quality and runtime (POP)

image

I am curious about the detailed algorithm and why random partitioning works well.