Eshe0922 / ReposVul

[ICSE 2024 Industry Challenge Track] Official implementation of "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".
https://conf.researchr.org/details/icse-2024/icse-2024-industry-challenge-track/2/ReposVul-A-Repository-Level-High-Quality-Vulnerability-Dataset
36 stars 3 forks source link

Function-Level Vulnerability Detection Support in ReposVul Dataset #2

Open Ffffffffire opened 5 months ago

Ffffffffire commented 5 months ago

Dear Authors,

Thanks for your efforts in conducting this significant research work. Your contributions are invaluable to advancing the field of vulnerability detection research.

I plan to undertake research on function-level vulnerability detection using the ReposVul dataset. However, I have encountered some challenges in trying to utilize the ReposVul dataset, specifically unclear on how to accurately extract function-level code snippets and their corresponding CVE and other relevant information from the provided data. Could you please offer some guidance or additional information on how to identify and extract individual function-level code snippets and other related information such as CVE details from the dataset. Thank you for taking the time to read my request. Looking forward to your reply!

Eshe0922 commented 5 months ago

Thank you for your interest in our work. We have displayed the data column names in Data Description.

bigpinwheel666 commented 4 months ago

Dear Authors,

Thanks for your efforts in conducting this significant research work.I would like to know what the values of -1,0,1 are for each target file in details. and which condition they correspond . looking forward to hearing from you

Eshe0922 commented 4 months ago

Thank you for your attention. 1 indicates that both LLM and static tools believe that the code snippet has vulnerabilities, while 0 indicates the opposite. When it is -1, there is a contradiction between the judgment results of LLM and static tools.