youyinnn / masc_research_knowledge_base

0 stars 0 forks source link

Paper review: Axiomatic Attribution for Deep Networks #10

Open youyinnn opened 2 years ago

youyinnn commented 2 years ago

ieee: https://ieeexplore.ieee.org/document/9671521


Abstract

The server workloads of large-scale online video games are elastic and on-demand. The workload can range from tens to thousands of server instances in short periods. In fact, a cloud-based video game ecosystem can reach a workload of millions of players every week. Given such a large scale, even a small portion of over-provisioning leads to a significant amount of resource idling and a high cost of waste. It is essential to define an effective forecasting model on the game session workloads given a time span. The effectiveness shall be measured by metrics representing Service Level Objectives (SLOs). In this work, we analyze time series forecasting models using ARIMA, Prophet, and LSTM to predict the number of virtual machines in need of cloud resource monitoring data. In addition, we define service-level metrics for measuring effectiveness based on factors of over/under provision and ratio of resource waste. We analyze models with 16 fleets with an average of 2754 game servers over a four-month-long period of time in the production environment. We observe that our LSTM model is the most accurate in forecasting the demand of virtual machines in terms of RMSE and MAE. Further analysis using metrics of SLOs, we observe that the LSTM model leads to more cases of under-provisioning than ARIMA and Prophet do. The LSTM model forecasts the demand of virtual machines with less over-provision ratio than ARIMA and Prophet do for 14 out of 16 fleets. Using the LSTM model, we further evaluate the forecasting effect across different time spans of a single fleet and across multiple fleets within the same time span.



Catch-up knowledge #### Mathematical 1. Gradient 1. [Youtube Khan Academy](https://www.youtube.com/c/khanacademy/search?query=gradient) 2. Path Integral 3. $F:R^n \to [0,1]$ This is a representation of a deep network. And $R^n$ means a set of a real n-vectors ($1 \times n$ matrices). $[0,1]$ means the real intervals $\{x|0\le x\le1\}$. http://ee263.stanford.edu/notes/basic-notation.pdf
youyinnn commented 2 years ago

Implementations:

  1. Understanding Deep Learning Models with Integrated Gradients
  2. Integrated Gradients for Deep Neural Networks
  3. code: