ji1ai1 / 202101-PAKDD2021

PAKDD2021 第二届阿里云智能运维算法大赛
4 stars 1 forks source link

There are not enough columns in the dataset #4

Open markoxu opened 2 years ago

markoxu commented 2 years ago

Hi there,

Actually, there are only 28 columns in the memory_sample_kernel_log_round1_b_test.csv, but 30 columns are specificed, which cause pandas parese errors. Do you have any idea? Thanks.

https://github.com/ji1ai1/202101-PAKDD2021/blob/35be9ac97d09e64e52eda0f5a75dd21c783f2cef/%E8%A8%93%E7%B7%B4.py#L130-L132

memory_sample_kernel_log_round1_b_test.csv columns, there are no columns like "故障時間" or "故障類型"

collect_time,1_hwerr_f,1_hwerr_e,2_hwerr_c,2_sel,3_hwerr_n,2_hwerr_s,3_hwerr_m,1_hwerr_st,1_hw_mem_c,3_hwerr_p,2_hwerr_ce,3_hwerr_as,1_ke,2_hwerr_p,3_hwerr_kp,1_hwerr_fl,3_hwerr_r,_hwerr_cd,3_sup_mce_note,3_cmci_sub,3_cmci_det,3_hwerr_pi,3_hwerr_o,3_hwerr_mce_l,serial_number,manufacturer,vendor
ji1ai1 commented 2 years ago

Round 1结束时,官方提供了同名但是包含标签的数据,新的数据有30列。这些数据可以在这里下载: memory_sample_address_log_round1_b_test.csv.zip memory_sample_kernel_log_round1_b_test.csv.zip memory_sample_mce_log_round1_b_test.csv.zip

markoxu commented 2 years ago

Got it! I will have a try later, thanks.