CV in ADBench - Githubissues

Minqi824 / ADBench

Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.

BSD 2-Clause "Simplified" License

853 stars 134 forks source link

I sincerely apologize for my late reply. Since in anomaly detection problems, there often exist only few labeled samples (e.g., 5 labeled anomalies) in the training set, while the labeled samples would even be reduced further in the cross-validation (CV) scenario. Some suggestions are that:

You can apply some data augmentation methods like oversampling or SMOTE, and then use CV on the concatenated dataset of training and testing datasets.
You can set the la (the ratio of labeled anomalies) to 1.00, therefore all the labeled anomalies are available in the training set, which can be further concatenated with testing set to perform cross-validation, although anomalies may still be very rare on some datasets.

Minqi824 / ADBench

CV in ADBench #8