hannousse / Cleaned-PHP-Webshell-dataset

1 stars 0 forks source link

Connection between php_webshell_dataset.csv and cleaned_dataset.zip #1

Open Chaiyahui1998 opened 1 year ago

Chaiyahui1998 commented 1 year ago

Dear Hannousse, I would like to kindly inquire about some information regarding two files in the "Cleaned-PHP-Webshell-dataset" project: php_webshell_dataset.csv and cleaned_dataset.zip. I am curious about their relationship and purpose, but I need more information to better understand the project.

  1. Regarding the php_webshell_dataset.csv file, I would like to know what data it contains and what its data format is.

  2. For the cleaned_dataset.zip file, I am interested in understanding its contents and whether it is related to php_webshell_dataset.csv. If there is a connection, could you please explain how they are related?

  3. How are these two files used within the project? Are they part of the project, or are they standalone datasets?

  4. Are there any accompanying documents or descriptions that can provide more details about these two files?

I greatly appreciate your time and assistance, and I hope to gain a more comprehensive understanding of these two files to better comprehend and utilize the project's data. Thank you once again for your support!

Warm regards

hannousse commented 1 year ago

Dear Reader,

I want to express my sincere gratitude for your interest in our work.

Regarding your inquiries:

  1. The php_webshell_dataset.csv file comprises a table displaying values for 100 features extracted from PHP files within the dataset, along with a label indicating whether they are classified as "normal" or "webshell." These extracted features encompass various metrics such as entropy, counts of suspicious function calls (e.g., eval, base64_decode, string replacement functions), and more.

  2. The cleaned_dataset.zip file contains the actual source files used to create the php_webshell_dataset.csv dataset.

  3. These datasets are integral components of a project that focuses on PHP webshell detection through making use of the machine learning technology. You can find the detailed research publications attached to this email.

  4. I believe that the information provided above, along with the attached files, will greatly assist you in comprehending the project's objectives, the dataset's composition, and its potential applications in your own work.

Good luck with your work.

Sincerely,

-- Abdelhakim Hannousse Computer Science Department Université 8 Mai 1945 Guelma, Algeria @. @.

From: "Chaiyahui1998" @.> To: "hannousse/Cleaned-PHP-Webshell-dataset" @.> Cc: "Subscribed" @.***> Sent: Monday, September 11, 2023 8:08:33 AM Subject: [hannousse/Cleaned-PHP-Webshell-dataset] Connection between php_webshell_dataset.csv and cleaned_dataset.zip (Issue #1)

Dear Hannousse, I would like to kindly inquire about some information regarding two files in the "Cleaned-PHP-Webshell-dataset" project: php_webshell_dataset.csv and cleaned_dataset.zip. I am curious about their relationship and purpose, but I need more information to better understand the project.

1. 

Regarding the php_webshell_dataset.csv file, I would like to know what data it contains and what its data format is.

For the cleaned_dataset.zip file, I am interested in understanding its contents and whether it is related to php_webshell_dataset.csv. If there is a connection, could you please explain how they are related?

How are these two files used within the project? Are they part of the project, or are they standalone datasets?

Are there any accompanying documents or descriptions that can provide more details about these two files?

I greatly appreciate your time and assistance, and I hope to gain a more comprehensive understanding of these two files to better comprehend and utilize the project's data. Thank you once again for your support!

Warm regards

— Reply to this email directly, view it on GitHub , or unsubscribe . You are receiving this because you are subscribed to this thread. Message ID: <hannousse/Cleaned-PHP-Webshell-dataset/issues/1 @ github . com>

Chaiyahui1998 commented 1 year ago

Dear Hannousse, I had the honor to read your three articles on WebShell detection, which deeply inspired me and triggered my strong interest in this field. I would love to be able to do some machine learning practice on the open source dataset you provide to further explore this area. Based on my initial understanding of the dataset, I found that 'php_webshell_dataset.csv' contains 992 normal and WebShell categories of data. clean_dataset.zip contains 992 normal files and 989 WebShell files. For better practice and analysis, I want to see how each row of data in the file 'php_webshell_dataset.csv' corresponds to the file in 'clean_dataset.zip'. This will be very helpful to my research. If possible, I would like to have detailed information on these correspondences or instructions on how to obtain them. Thank you again for your open source contributions, and I look forward to working meaningfully on your dataset. Sincere regards

hannousse commented 1 year ago

You find attached to this email the Python script used to generate the dataset, I'm not sure if the last version of the source code, nut it can help you understand and practice the feature extraction yourself.

Good luck.

-- Abdelhakim Hannousse Computer Science Department Université 8 Mai 1945 Guelma, Algeria @. @.

From: "Chaiyahui1998" @.> To: "hannousse/Cleaned-PHP-Webshell-dataset" @.> Cc: "hannousse" @.>, "Comment" @.> Sent: Monday, September 11, 2023 11:38:45 AM Subject: Re: [hannousse/Cleaned-PHP-Webshell-dataset] Connection between php_webshell_dataset.csv and cleaned_dataset.zip (Issue #1)

Dear Hannousse, I had the honor to read your three articles on WebShell detection, which deeply inspired me and triggered my strong interest in this field. I would love to be able to do some machine learning practice on the open source dataset you provide to further explore this area. Based on my initial understanding of the dataset, I found that 'php_webshell_dataset.csv' contains 992 normal and WebShell categories of data. clean_dataset.zip contains 992 normal files and 989 WebShell files. For better practice and analysis, I want to see how each row of data in the file 'php_webshell_dataset.csv' corresponds to the file in 'clean_dataset.zip'. This will be very helpful to my research. If possible, I would like to have detailed information on these correspondences or instructions on how to obtain them. Thank you again for your open source contributions, and I look forward to working meaningfully on your dataset. Sincere regards

— Reply to this email directly, view it on GitHub , or unsubscribe . You are receiving this because you commented. Message ID: <hannousse/Cleaned-PHP-Webshell-dataset/issues/1/1713622183 @ github . com>

Chaiyahui1998 commented 1 year ago

Thank you for your response and the information provided. I'm looking forward to receiving the Python script for generating the dataset, which will help me gain a deeper understanding and practice feature extraction. However, I haven't received the script attachment yet.

If possible, could you please resend it or provide a download link for the script? This would be greatly beneficial for my learning and practical work. I appreciate your support and assistance very much.

Best regards,

(Email: yahuichai1998@gmail.com)