Directory sample experiment historically contained bunch of malicious APKs and configs that can be run to create records.json. We should extend the layout to cover also subsequent phases of the experiments. Furthermore, we should introduce some benign samples into the process. Basically, the aim of the sample experiment is to fully reproduce what we did, but in small scale on 100ish samples.
What we should do:
[ ] Upload a small datasets of benign and malicious samples that can be publicly shared (instead of hosting them on GitHub)
[ ] Create a script that will fetch the dataset and run the full experiment
[ ] Create new configs / description / readmes
[ ] Test the experiment on Docker.
Parts of experiment that are currently not covered:
[ ] Data preparation (records.json -> features)
[ ] Data training (not sure how to handle crossvalidation on so few samples, but maybe it's going to work out)
Directory sample experiment historically contained bunch of malicious APKs and configs that can be run to create
records.json
. We should extend the layout to cover also subsequent phases of the experiments. Furthermore, we should introduce some benign samples into the process. Basically, the aim of the sample experiment is to fully reproduce what we did, but in small scale on 100ish samples.What we should do:
Parts of experiment that are currently not covered:
records.json
-> features)