logpai / loglizer

A machine learning toolkit for log-based anomaly detection [ISSRE'16]
MIT License
1.29k stars 425 forks source link

More info needed #12

Closed sharanbr closed 6 years ago

sharanbr commented 6 years ago

Can you update readme to indicate which Python file is used for what purpose. There are many Python files and it is not clear what is the purpose of each of these

ShilinHe commented 6 years ago

Thanks for your suggestion! We will update the Readme or add a Document soon or late.

sharanbr commented 6 years ago

Hello,

Can you please let me know what hdfs and bgl stand for? Thanks a lot.

On Tue, Aug 28, 2018 at 12:40 PM Shilin HE notifications@github.com wrote:

Thanks for your suggestion! We will update the Readme or add a Document soon or late.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/logpai/loglizer/issues/12#issuecomment-416475570, or mute the thread https://github.com/notifications/unsubscribe-auth/AT7C3RXbz_mEYJSBIxmBlQcBqEyIrV4Uks5uVOzqgaJpZM4WN9Pk .

sharanbr commented 6 years ago

Dear Shilin,

I need one more input. I am trying to run classifier_bgl file. Python shows error with respect to the data file. IOError: File ../../Data/BGL_data/BGL_MERGED.log does not exist

Do I need to download any data file? Is BGL_MERGED.log an input or an output file?

Can you give some directions?

On Sat, Sep 1, 2018 at 11:50 AM Sharan Basappa sharan.basappa@gmail.com wrote:

Hello,

Can you please let me know what hdfs and bgl stand for? Thanks a lot.

On Tue, Aug 28, 2018 at 12:40 PM Shilin HE notifications@github.com wrote:

Thanks for your suggestion! We will update the Readme or add a Document soon or late.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/logpai/loglizer/issues/12#issuecomment-416475570, or mute the thread https://github.com/notifications/unsubscribe-auth/AT7C3RXbz_mEYJSBIxmBlQcBqEyIrV4Uks5uVOzqgaJpZM4WN9Pk .

ShilinHe commented 6 years ago

The hdfs and bgl are two log datasets that we used in our paper, and you can find them in the loghub project (https://github.com/logpai/loghub). You may request the datasets as introduced in the Readme

sharanbr commented 6 years ago

Hi Shilin,

I have downloaded the dataset at loghub. Do I need to rename any files to run your code? I see that classifier_bgl looks for BGL_MERGED.log but I don't see such a file in the downloaded dataset. Do I need to rename BGL_2k.txt as BGL_MERGED.log and move it to BGL_DATA directory?

Also, do you know if anyone has successfully run these code before?

On Sat, Sep 1, 2018 at 4:56 PM Shilin HE notifications@github.com wrote:

The hdfs and bgl are two log datasets that we used in our paper, and you can find them in the loghub project (https://github.com/logpai/loghub). You may request the datasets as introduced in the Readme

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/logpai/loglizer/issues/12#issuecomment-417852947, or mute the thread https://github.com/notifications/unsubscribe-auth/AT7C3bFsUnyg7NwUwROEJn6HpLqS3GfJks5uWm7ogaJpZM4WN9Pk .

ShilinHe commented 6 years ago

BGL_2k is just a small sample, you should request the full datasets through the link in Readme file.

zhujiem commented 5 years ago

We have updated our demo code recently for readability. Pls try the new demos. We will also release the benchmark code soon. @sharanbr