Open ReamonYim opened 2 months ago
The training datasets are already preprocessed and can be directly used to train LLMs in step 2.4 and 2.5. The preprocess codes only work for extracting training data from raw traffic (i.e., .pcap files). If you want to reproduce the process of extracting the training data from the raw dataset of USTC TFC 2016, please download the raw dataset using its released link.
I hope this reply can help you.
The training datasets are already preprocessed and can be directly used to train LLMs in step 2.4 and 2.5. The preprocess codes only work for extracting training data from raw traffic (i.e., .pcap files). If you want to reproduce the process of extracting the training data from the raw dataset of USTC TFC 2016, please download the raw dataset using its released link.
I hope this reply can help you.
thank you very much
Dear author,
I encountered an issue when running the command:
The error is:
I downloaded the ustc-tfc-2016 files from training datasets. Could you please confirm whether the files from this link are already preprocessed or if they are the raw files needed for running the script?
Thank you!