wuyifan18 / DeepLog

Pytorch Implementation of DeepLog.
MIT License
361 stars 154 forks source link

How to decide number of candidates #56

Open alishan2040 opened 2 years ago

alishan2040 commented 2 years ago

I trained the model on my custom dataset in which each line contains 3 events (e1, e2, e3) (a tri-gram) where I want to predict e3 based on e1 and e2. An example is shown below: image After training for 50 epochs I got: image

On the predict stage, I kept num_candidates=1, then I got very low F1-score image

num_classes in my dataset are 9. If I increase, num_candidates, divide by zero error occurred. What is the best approach to choose num_candidates for given dataset?

Thanks

shoaib-intro commented 1 year ago

If I try to address this increasing candidate size will increase accuracy and will be less sensitive to anomalies vise versa. threshold here is window size.