mental2008 / awesome-papers

Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
https://paper.lingyunyang.com/
MIT License
36 stars 2 forks source link

OSDI '20 | Twine: A Unified Cluster Management System for Shared Infrastructure #45

Closed mental2008 closed 1 year ago

mental2008 commented 2 years ago

Presented in OSDI '20. [ Paper | Slides | Video ]

Authors: Chunqiang Tang, Kenny Yu, Kaushik Veeraraghavan, Jonathan Kaldor, Scott Michelson, Thawan Kooburat, Aravind Anbudurai, Matthew Clark, Kabir Gogia, Long Cheng, Ben Christensen, Alex Gartrell, Maxim Khutornenko, Sachin Kulkarni, Marcin Pawlowski, Tuomas Pelkonen, Andre Rodrigues, Rounak Tibrewal, Vaishnavi Venkatesan, and Peter Zhang Facebook Inc.

Reference

mental2008 commented 2 years ago

This system was originally named Tupperware, which is a homemade framework written and used internally at Facebook.

Three different designs

  1. Dynamic machine partitioning, not static clusters.
  2. Customization in shared infrastructure, not private pools.
  3. Small machines, not big machines.