leibinghe / GAAL-based-outlier-detection

GAAL-based Outlier Detection
131 stars 58 forks source link

Data Reduction in Datasets you provided #9

Open panda7777777 opened 8 months ago

panda7777777 commented 8 months ago

I noticed significant data reduction in the datasets you provided (i.e., SpamBase, WDBC, ...) compared to their original versions. This isn't addressed in your documentation. Can you explain the reasoning behind these changes?

leibinghe commented 2 months ago

I noticed significant data reduction in the datasets you provided (i.e., SpamBase, WDBC, ...) compared to their original versions. This isn't addressed in your documentation. Can you explain the reasoning behind these changes?

All datasets removed duplicate data, as described in the Section 4.1.1 “we adopt the procedure described in [46] to convert the datasets to outlier evaluation datasets.”