elastic / ember

Elastic Malware Benchmark for Empowering Researchers
Other
949 stars 277 forks source link

Problem training EMBER 2017 feature version 2 #48

Open wilsoncwj opened 4 years ago

wilsoncwj commented 4 years ago

I have successfully trained the EMBER 2018 dataset (feature version 2) using the train_ember.py script, and managed to plot the charts/results using the jupyter notebook.

However, when attempting to run python train_ember.py [/path/to/dataset] command for the EMBER 2017 feature version 2 dataset, I get the following warning:

[LightGBM] [Warning] Contains only one class
[LightGBM] [Info] Number of positive: 0, number of negative: 900000
[LightGBM] [Info] Total Bins 95616
[LightGBM] [Info] Number of data: 900000, number of used features: 1822
[LightGBM] [Info] [binary:BoostFromScore]: pavg=0.000000 -> initscore=-34.538776
[LightGBM] [Info] Start training from score -34.538776
[LightGBM] [Info] [binary:BoostFromScore]: pavg=0.000000 -> initscore=-34.538776
[LightGBM] [Warning] Stopped training because there are no more leaves that meet the split requirements

Any idea what went wrong?