sspku-2021 / PBCNN

15 stars 8 forks source link

How can I get the datasets used in this project? #1

Open dycyber opened 3 years ago

dycyber commented 3 years ago

The dataset I downloaded is like this. Friday-WorkingHours.pcap Monday-WorkingHours.pcap Thursday-WorkingHours.pcap Tuesday-WorkingHours.pcap Wednesday-WorkingHours.pcap

But I found that this data set does not match your code. I'm not clear about the dataset of your project, such as' / home / fgtc / documents / notebooks / data cache/raw benign_ What kind of files are there? Can you give me a picture of the document tree? thank!

YAMY1234 commented 2 years ago

The dataset I downloaded is like this. Friday-WorkingHours.pcap Monday-WorkingHours.pcap Thursday-WorkingHours.pcap Tuesday-WorkingHours.pcap Wednesday-WorkingHours.pcap

But I found that this data set does not match your code. I'm not clear about the dataset of your project, such as' / home / fgtc / documents / notebooks / data cache/raw benign_ What kind of files are there? Can you give me a picture of the document tree? thank!

Same problem with me! I do not know how to process the original pcap files of IDS2017 and IDS2018 for not knowing the matching labels, how can I get the matching labels determine the benign from malicious.

I didn't find the place in your code where you dealt with the problem above. Could you tell me the where or how you solved this problem?

sspku-2021 commented 2 years ago

Read the data set description on the website carefully!

------------------ Original ------------------ From: YAMY @.> Date: Sat,Mar 19,2022 0:55 PM To: sspku-2021/PBCNN @.> Cc: Subscribed @.***> Subject: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1)

The dataset I downloaded is like this. Friday-WorkingHours.pcap Monday-WorkingHours.pcap Thursday-WorkingHours.pcap Tuesday-WorkingHours.pcap Wednesday-WorkingHours.pcap

But I found that this data set does not match your code. I'm not clear about the dataset of your project, such as' / home / fgtc / documents / notebooks / data cache/raw benign_ What kind of files are there? Can you give me a picture of the document tree? thank!

Same problem with me! I do not know how to process the original pcap files of IDS2017 and IDS2018 for not knowing the matching labels, how can I get the matching labels determine the benign from malicious.

I didn't find the place in your code where you dealt with the problem above. Could you tell me the where or how you solved this problem?

— Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you are subscribed to this thread.Message ID: @.***>

YAMY1234 commented 2 years ago

Thanks for answering back! Sorry for my unfamiliarity with this dataset, I have again seen through the IDS2017 description, but found nothing new 🙁 the related information about the components of datasets is as below: " The CICIDS2017 dataset consists of labeled network flows, including full packet payloads in pcap format, the corresponding profiles and the labeled flows (GeneratedLabelledFlows.zip) and CSV files for machine and deep learning purpose (MachineLearningCSV.zip) are publicly available for researchers. If you are using our dataset, you should cite our related paper which outlining the details of the dataset and its underlying principles: " The whole dataset tree is: | GeneratedLabelledFlows.md5 | 2019-09-10 11:05 | 61 |     | GeneratedLabelledFlows.zip | 2019-09-10 11:20 | 271M |     | MachineLearningCSV.md5 | 2019-09-10 11:06 | 57 |     | MachineLearningCSV.zip | 2019-09-10 11:17 | 224M |     | PCAPs/ | 2019-09-11 08:37

To more specifically state my problem: I compared the OVERAL PACKAGE NUMBER of "GeneratedLabelledFlows/Monday-WorkingHours.pcap_ISCX.csv" and that of "PCAPs/WorkingHours.pcap" , the number dos not matched /(ㄒoㄒ)/~~,(number of pcap packages(1170971) is 20 times greater than that of the csv(529919)).

How can I relate the specific label of benign or malicious to the pcap file ? Could you please tell me a bit more specifically on this problem? Thanks a lot !

------------------ 原始邮件 ------------------ 发件人: "sspku-2021/PBCNN" @.>; 发送时间: 2022年3月19日(星期六) 中午12:59 @.>; @.**@.>; 主题: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1)

Read the data set description on the website carefully!

------------------ Original ------------------ From: YAMY @.> Date: Sat,Mar 19,2022 0:55 PM To: sspku-2021/PBCNN @.> Cc: Subscribed @.***> Subject: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1)

The dataset I downloaded is like this. Friday-WorkingHours.pcap Monday-WorkingHours.pcap Thursday-WorkingHours.pcap Tuesday-WorkingHours.pcap Wednesday-WorkingHours.pcap

But I found that this data set does not match your code. I'm not clear about the dataset of your project, such as' / home / fgtc / documents / notebooks / data cache/raw benign_ What kind of files are there? Can you give me a picture of the document tree? thank!

Same problem with me! I do not know how to process the original pcap files of IDS2017 and IDS2018 for not knowing the matching labels, how can I get the matching labels determine the benign from malicious.

I didn't find the place in your code where you dealt with the problem above. Could you tell me the where or how you solved this problem?

— Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you are subscribed to this thread.Message ID: @.> — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you commented.Message ID: @.>

hjvhj commented 2 years ago

感谢您的回复!很抱歉我不熟悉这个数据集,我再次看到了IDS2017的描述,但没有发现任何新的东西 🙁有关数据集组件的相关信息如下:“CICIDS2017数据集由标记的网络流组成,包括pcap格式的完整数据包有效负载,相应的配置文件和标记的流(生成标记流.zip)和用于机器和深度学习目的的CSV文件(MachineLearningCSV.zip)可供研究人员公开使用。如果您正在使用我们的数据集,您应该引用我们的相关论文,其中概述了数据集的详细信息及其基本原则:“整个数据集树是:|生成标记流.md5 |2019-09-10 11:05 |61 |&|生成的标记流.zip |2019-09-10 11:20 |2.71亿|&|MachineLearningCSV.md5 |2019-09-10 11:06 |57 |&|机器学习.ZIP |2019-09-10 11:17 |2.24亿|&|电脑行动计划/|2019-09-11 08:37 更具体地陈述我的问题:我比较了“生成的标记流/星期一WorkingHours.pcapISCX.csv”和“PCAP/WorkHours.pcap”的过度包数,不匹配的数字不匹配/(ㄒoㄒ)/~~,(pcap包的数量(1170971)比csv(529919)多20倍)。如何将良性或恶意的特定标签与pcap文件相关联?你能更具体地告诉我这个问题吗?多谢! ... ------------------ 原始邮件 ------------------ 发件人: "sspku-2021/PBCNN" @.>; 发送时间: 2022年3月19日(星期六) 中午12:59 @.>; @.**@.>; 主题: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1) Read the data set description on the website carefully! ------------------ Original ------------------ From: YAMY @.> Date: Sat,Mar 19,2022 0:55 PM To: sspku-2021/PBCNN @.> Cc: Subscribed @.***> Subject: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1) The dataset I downloaded is like this. Friday-WorkingHours.pcap Monday-WorkingHours.pcap Thursday-WorkingHours.pcap Tuesday-WorkingHours.pcap Wednesday-WorkingHours.pcap But I found that this data set does not match your code. I'm not clear about the dataset of your project, such as' / home / fgtc / documents / notebooks / data cache/raw benign What kind of files are there? Can you give me a picture of the document tree? thank! Same problem with me! I do not know how to process the original pcap files of IDS2017 and IDS2018 for not knowing the matching labels, how can I get the matching labels determine the benign from malicious. I didn't find the place in your code where you dealt with the problem above. Could you tell me the where or how you solved this problem? — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you are subscribed to this thread.Message ID: @.> — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you commented.Message ID: @.>

Have you solved this problem? Could you ask how to solve it

sspku-2021 commented 2 years ago

可以看看IDS2018的描述,pcap文件对应的标签是和文件产生时间相关的,仔细看一下那个数据集网站对数据集的描述。

发自我的iPhone

------------------ Original ------------------ From: lasjc @.> Date: Sat,Oct 15,2022 1:43 PM To: sspku-2021/PBCNN @.> Cc: sspku-2021 @.>, Comment @.> Subject: Re: [sspku-2021/PBCNN] How can I get the datasets used in thisproject? (#1)

感谢您的回复!很抱歉我不熟悉这个数据集,我再次看到了IDS2017的描述,但没有发现任何新的东西 🙁有关数据集组件的相关信息如下:“CICIDS2017数据集由标记的网络流组成,包括pcap格式的完整数据包有效负载,相应的配置文件和标记的流(生成标记流.zip)和用于机器和深度学习目的的CSV文件(MachineLearningCSV.zip)可供研究人员公开使用。如果您正在使用我们的数据集,您应该引用我们的相关论文,其中概述了数据集的详细信息及其基本原则:“整个数据集树是:|生成标记流.md5 |2019-09-10 11:05 |61 |&|生成的标记流.zip |2019-09-10 11:20 |2.71亿|&|MachineLearningCSV.md5 |2019-09-10 11:06 |57 |&|机器学习.ZIP |2019-09-10 11:17 |2.24亿|&|电脑行动计划/|2019-09-11 08:37 更具体地陈述我的问题:我比较了“生成的标记流/星期一WorkingHours.pcapISCX.csv”和“PCAP/WorkHours.pcap”的过度包数,不匹配的数字不匹配/(ㄒoㄒ)/~~,(pcap包的数量(1170971)比csv(529919)多20倍)。如何将良性或恶意的特定标签与pcap文件相关联?你能更具体地告诉我这个问题吗?多谢! ... ------------------ 原始邮件 ------------------ 发件人: "sspku-2021/PBCNN" @.>; 发送时间: 2022年3月19日(星期六) 中午12:59 @.>; @.@.>; 主题: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1) Read the data set description on the website carefully! ------------------ Original ------------------ From: YAMY @.> Date: Sat,Mar 19,2022 0:55 PM To: sspku-2021/PBCNN @.> Cc: Subscribed @.> Subject: Re: [sspku-2021/PBCNN] How can I get the datasets used in this project? (#1) The dataset I downloaded is like this. Friday-WorkingHours.pcap Monday-WorkingHours.pcap Thursday-WorkingHours.pcap Tuesday-WorkingHours.pcap Wednesday-WorkingHours.pcap But I found that this data set does not match your code. I'm not clear about the dataset of your project, such as' / home / fgtc / documents / notebooks / data cache/raw benign What kind of files are there? Can you give me a picture of the document tree? thank! Same problem with me! I do not know how to process the original pcap files of IDS2017 and IDS2018 for not knowing the matching labels, how can I get the matching labels determine the benign from malicious. I didn't find the place in your code where you dealt with the problem above. Could you tell me the where or how you solved this problem? — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you are subscribed to this thread.Message ID: @.> — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you commented.Message ID: @.***>

Have you solved this problem? Could you ask how to solve it

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>

hjvhj commented 1 year ago

Why is the number of session streams obtained from the "benign" label I extracted much more than the number of session streams in the paper? How can I extract the "benign" label from the pcap? Can you tell me how? Thank you very much.