I have some question for the number of data in dataset. In your paper, it says that there are 18,169 programs in the ReVeal dataset, while I found 2,240 + 20,494 items in the two json file https://drive.google.com/drive/folders/1KuIYgFcvWUXheDhT--cBALsfy1I4utOy. I wonder what causes the difference, and how should I use the data in the json file?
My recommendation is going straight to the NVD and SARD test cases, and not using released partial datasets from authors. This is no critique to the ReVeal authors.
Hi, thanks for the awesome project and dataset.
I have some question for the number of data in dataset. In your paper, it says that there are 18,169 programs in the ReVeal dataset, while I found 2,240 + 20,494 items in the two json file https://drive.google.com/drive/folders/1KuIYgFcvWUXheDhT--cBALsfy1I4utOy. I wonder what causes the difference, and how should I use the data in the json file?
Thanks :)