LiDan456 / MAD-GANs

Applied generative adversarial networks (GANs) to do anomaly detection for time series data
514 stars 158 forks source link

why can you get labels in the "kdd99_test.npy"? #19

Open AmberLay opened 3 years ago

AmberLay commented 3 years ago

The dataset from http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html has 42 features and no labels, but you have 35 and get labels, did you do some preprocessings, please?

luckyStarForever commented 3 years ago

Would you like to tell me , "What are the requirements for the code project environment ?"

You can also run commands in the project environment:

pip freeze > D:\requirements.txt

Please share the requirements.txt content. Thank you very much!

ADomec commented 2 years ago

Sorry for the late answer.

The dataset from http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html has 42 features and no labels, but you have 35 and get labels, did you do some preprocessings, please?

About the features: I am guessing that the "symbolic" (7 of the 42) features are excluded, as they may not give any useful information. About the labels: On the same link that you provided you can see a file named "corrected.gz", where the test dataset is labeled on the last column. Train is supposed to be normal behaviour, as such, you could label them as "normal".