CGCL-codes / VulCNN

66 stars 14 forks source link

note data.txt file #3

Open rjiles opened 2 years ago

rjiles commented 2 years ago

Hello,

I am stuck at step 3, it seems that no data.txt file has been provided or generated from previous steps. How do I get data.txt?

VulCNN commented 2 years ago

You can download some C/C++ open-source projects such as Linux kernel and normalize them to generate the data.txt by yourself. More details can be found in https://github.com/epfml/sent2vec#train-a-new-sent2vec-model.

rjiles commented 2 years ago

I am still confused, so the normalization step "python ./normalization.py -i ./data/sard" should have produced a data.txt file?