alibaba / clusterdata

cluster data collected from production clusters in Alibaba for cluster management research
1.54k stars 402 forks source link

cluster-trace-gpu-v2020中pai_sensor_table 中cpu_usage, gpu_wrk_util, avg_gpu_wrk_mem , max_gpu_wrk_mem 疑问 #182

Closed Jackjiayou closed 1 year ago

Jackjiayou commented 1 year ago

请问pai_sensor_table 中cpu_usage, gpu_wrk_util, avg_gpu_wrk_mem , max_gpu_wrk_mem代表一个任务(以worker_name区分)所用资源还是在这个任务执行的同时还会有别的任务一起使用cpu或者gpu从而代表着这个执行任务那一时刻这台机器所被占用的资源(包含其他任务)

qzweng commented 1 year ago

data cpu_usage, gpu_wrk_util, avg_gpu_wrk_mem, max_gpu_wrk_mem in pai_sensor_table is the metric for each instance, not the machine.