d0ng1ee / logdeep

log anomaly detection toolkit including DeepLog
MIT License
387 stars 115 forks source link

Question about hdfs_train, hdfs_test_normal, and hdfs_test_abnormal #29

Open tongxiao-cs opened 2 years ago

tongxiao-cs commented 2 years ago

Thanks for your awesome work! @donglee-afar

I have two questions about hdfs_train, hdfs_test_normal, and hdfs_test_abnormal:

1) How to get them from the whole dataset? I mean, how to divide the whole dataset into train and test after we already have an event id sequence for each BlockId in the HDFS log?

2) I learn from data/hdfs/gen_train_data.py that, hdfs_train contains only normal data. I wonder if I'm right.

Looking forward to your reply! Thank you!

X-zhihao commented 1 year ago

Did you solve this problem?