Regarding the number of data in ReVeal dataset

VulDetProject / ReVeal

MIT License

187 stars 63 forks source link

Regarding the number of data in ReVeal dataset #17

Open DuanXu-97 opened 2 years ago

DuanXu-97 commented 2 years ago

Hi, thanks for the awesome project and dataset.

I have some question for the number of data in dataset. In your paper, it says that there are 18,169 programs in the ReVeal dataset, while I found 2,240 + 20,494 items in the two json file https://drive.google.com/drive/folders/1KuIYgFcvWUXheDhT--cBALsfy1I4utOy. I wonder what causes the difference, and how should I use the data in the json file?

Thanks :)

NikolasBielski commented 2 years ago

My recommendation is going straight to the NVD and SARD test cases, and not using released partial datasets from authors. This is no critique to the ReVeal authors.