Allow for data processing to be done by modules external from the stimulus-py package and not necessarily in python
Example :
Splitting a sequence dataset could be done using pairwise similarity + kmeans
There are many tools doing pairwise similarity, some even gpu accelerated, and some nf-core modules could do this
Current way to do this would be to wrap the method in python and bumb the pip package but it isn't the nf-core way, it would be much better if instead, we could re-use modules with some minor overhead for format/data processing.
This should be done while considering :
whether the tool should be ran or not depends on the experiment config
code should be kept clean (there are many tools that could interface with data processing - and the number will keep growing ), chaining IF/ELSE is not an option
good error handling (i.e. blast can't run on images for instance)
Description of feature
Allow for data processing to be done by modules external from the stimulus-py package and not necessarily in python
Example :
Splitting a sequence dataset could be done using pairwise similarity + kmeans There are many tools doing pairwise similarity, some even gpu accelerated, and some nf-core modules could do this
Current way to do this would be to wrap the method in python and bumb the pip package but it isn't the nf-core way, it would be much better if instead, we could re-use modules with some minor overhead for format/data processing.
This should be done while considering :