cytomining / profiling-recipe

Image-based Profiling Recipe
BSD 3-Clause "New" or "Revised" License
8 stars 26 forks source link

Move from linear execution strategy to modular block design #11

Open gwaybio opened 3 years ago

gwaybio commented 3 years ago

Currently, the recipe (as defined in jump-cellpainting/profiling-recipe#14) is linear, with each step progressing sequentially. For example, the normalization step happens before the feature selection step.

A use case came up in the JUMP project that we will want to apply different normalization steps to the same input file, and then process the same feature selection step to both normalization output files.

This process is akin to a block design, in which each pipeline step is performed if and only if a "block" is added to the yaml config file.

A couple of implications of this enhancement:

shntnu commented 3 years ago

@gwaygenomics Do you recollect what you meant by "subprocess" above? Here are the recipe files, in case it helps point to things https://github.com/jump-cellpainting/profiling-recipe/tree/master/profiles

gwaybio commented 3 years ago

Ah, good point. I was wrong about this recipe using subprocess. We use it in the pooled recipe, so this is likely how my wires got mixed.