mental2008 / awesome-papers

Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
https://paper.lingyunyang.com/
MIT License
45 stars 3 forks source link

ASPLOS '16 | Baymax: QoS Awareness and Increased Utilization for Non-Preemptive Accelerators in Warehouse Scale Computers #30

Closed mental2008 closed 1 year ago

mental2008 commented 3 years ago

Presented in ASPLOS '16. [ Paper ]

Authors: Quan Chen, Hailong Yang, Jason Mars, Lingjia Tang University of Michigan, Shanghai Jiao Tong University, Beihang University

mental2008 commented 3 years ago

Existing problem

Due to the diurnal pattern of application workloads, it's intuitive to co-locate the applications on the accelerators (e.g., GPUs, ASICs, and FPGAs) to improve the utilization and efficiency. But the current work doesn't address the problem of interference when co-locating applications on the accelerators. The problem is fundamentally different than contention on multi-core CPUs.

Contributions

  1. Identify the main two factors which cause the long tails of online applications.
    • Queuing delay for the computer resource.
    • PCI-e bandwidth contention for data transfer.
  2. Present a run time system named Baymax.
    • Orchestrate the execution** of computing tasks from different applications.
    • Mitigate PCI-e bandwidth contention.
    • Increase the utilization.

My complaints

Though the research problem is interesting, some concepts are still hard to follow.

  1. Warehouse-scale computers (WSCs) -> I prefer to say large-scale clusters.
  2. Intelligent personal assistant (IPA) workloads -> I prefer to say machine learning / deep learning workloads.
  3. Non-preemptive accelerators -> Authors only use GPU as the accelerator platform, why not directly use GPU to replace this complex concept?

The design of Baymax is as follows: image

Comparison between Baymax and prior work: image

We also would like to acknowledge Xayden, born 5 weeks before the deadline, for being an awesome and easy baby.

Glad to see this ;-) Just curious whose baby this is?