Add modular structure so that users can specify which subset of validation comparisons they want to see run (e.g., include vectors? include interventions?) #42
We are going to add a column "validation_subset" to the simulation_coordinator.csv file to organize individual sites to different subsets. Here is how the column will look like:
The subset names are dummy names which we will discourse the read names later.
2. Change at execution
I am going to update our code so that when use run the snakemake command, they have a way to choose which subset or list of subsets to run. Currently I am leaning towards adding the subsets as a snakemake config which has a default value to all sites and can be override by user at run time as:
snakemake --config s='subset_1, subset_5' -j
Note that the subset name should be non-case-sensitive.
Design details as Dec 7th 2022:
1. Change in the data:
We are going to add a column "validation_subset" to the simulation_coordinator.csv file to organize individual sites to different subsets. Here is how the column will look like:
The subset names are dummy names which we will discourse the read names later.
2. Change at execution
I am going to update our code so that when use run the snakemake command, they have a way to choose which subset or list of subsets to run. Currently I am leaning towards adding the subsets as a snakemake config which has a default value to all sites and can be override by user at run time as:
snakemake --config s='subset_1, subset_5' -j
Note that the subset name should be non-case-sensitive.3. Change in plotting and report:
TBD