ewels / clusterflow

A pipelining tool to automate and standardise bioinformatics analyses on cluster environments.
https://ewels.github.io/clusterflow/
GNU General Public License v3.0
97 stars 27 forks source link

Posible to specify output dir ? #118

Closed arontommi closed 7 years ago

arontommi commented 7 years ago

Well the title says it all, Big fan of cf, been using it allot on uppmax. Is there a way specify output locations for modules ?

ewels commented 7 years ago

Hi @arontommi,

Not currently, no sorry. How exactly do you see this working - you mean having the output from each CF module go into a subdirectory? I guess that this could be possible..

Phil

arontommi commented 7 years ago

This is mainly a nonissue, i was thinking this in the sense of being able to work on raw files from a space limited backed up directory, and spitting out into nobackup.

ewels commented 7 years ago

Ah I see - so just have all output files end up in a different directory? That one is easier.. 😌 Cluster Flow can't do this for you, but it's pretty simple to do using softlinks; I do this myself all the time on UPPMAX.

For example:

cd /proj/b2017001/nobackup/private/my_proj/
ln -s /proj/b2017001/private/my_proj/*.fastq.gz .
cf --genome GRCh37 fastq_star *.fastq.gz

The output files are always created in the working directory, so in theory I think you may even just be able to run in nobackup and specify the path to the raw data wherever it is. However, I think that from memory that I may have run into issues where some output filenames are created using the complete filenames, so /abs/path/file.fastq becomes /abs/path/file.bam, meaning that it's still saved in the input file directory. Given the simplicity of the softlinking solution I'm not in a huge rush to address this though.

Hope this helps! Let me know if it works / you run into any problems.

Phil

arontommi commented 7 years ago

Great thanks! still a little wet behind the ears :)

ewels commented 7 years ago

No problem at all! Glad that you're finding Cluster Flow useful :)