ewels / clusterflow

A pipelining tool to automate and standardise bioinformatics analyses on cluster environments.
https://ewels.github.io/clusterflow/
GNU General Public License v3.0
97 stars 27 forks source link

Have genome details available in modules #75

Closed ewels closed 8 years ago

ewels commented 8 years ago

Have information about the reference genome (name, build, key etc) available in a variable for modules to use.

s-andrews commented 8 years ago

The idea for this is that for modules which use the genome it would be useful to be able to write the genome alias into the output file. For this we wouldn't need the actual files uses (fasta, indices etc), but rather the short name listed in the conf file and which the user would have specified when they launched clusterflow with the --genome parameter.

ewels commented 8 years ago

Looking at the code I think that because of a bonus "and everything else" parsing line I added, this should already be available to modules. Untested, but something like this:

my $genome = $cf{'config'}{'genome'};

The same goes for anything else in a runfile that has a line that begins with an @, apart from @notification and @reference which are parsed separately:

@split_files    2
@total_cores    16
@total_mem      128G
@email          ...
@notification   suspend
@genome         GRCh37
@reference      fasta    /path/to/ref
@force_paired_end

This should do the job?

ewels commented 8 years ago

Closing this issue as already implemented. Feel free to reopen if I've missed something.