ShoumikSaha / DRSM

This is the implementation of DRSM paper
GNU General Public License v3.0
9 stars 3 forks source link

Dataset preparation #3

Open xiangchong1 opened 3 months ago

xiangchong1 commented 3 months ago

Is there any instruction or script for downloading and preparing the datasets used in the paper? Thank you!

yinan17 commented 2 months ago

Hi Chong, I've solved this problem. You can write a python script to directly traverse the URL address in the file and decompress them.

kzbnb commented 2 months ago

Hi Chong, I've solved this problem. You can write a python script to directly traverse the URL address in the file and decompress them.

Hi Yinan, could you share the python script on processing the datasets? That will be very helpful!

ShoumikSaha commented 2 months ago

Hi All,

I have added a python script download_benign.py. You can follow/use this file directly to download the benign files. You might have to change a few lines in the code (already commented) for different sources.