mskcc / tempo

CCS research pipeline to process WES and WGS TN pairs
https://cmotempo.netlify.com/
12 stars 5 forks source link

storeDir option doesn't work for all modules #1000

Open anoronh4 opened 7 months ago

anoronh4 commented 7 months ago

When testing the storeDir option with tempo, not all processes successfully cache. For example i get the message:

WARN: [RunBQSR (s_C_5DD7V7_N001_d)] StoreDir can only be used when using 'file' outputs

This means that every time we want to restart tempo for some reason, RunBQSR will be re-run for every sample, even when using the -resume option. I think the reason this happens is because of these lines of code: https://github.com/mskcc/tempo/blob/cd3ce1364481e474f70639b8b9da51412b99a6e6/pipeline.nf#L583-L585 The variables idSample and target are already in the process context, but the others are not. This might be easily adjustable. I have not tested this yet with non-alignment processes, so i don't have a complete view of which processes cannot be cached when using storeDir

anoronh4 commented 5 months ago

Other processes in dsl2 Tempo (outside of alignment_wf) that need to be adjusted include:

(this list may be updated)