Closed vinjana closed 7 years ago
The first implementation of fast-track in the LSF cluster will be via queues. There is a "fasttrack" queue (see bqueues
command on head node) and similar queue are planned for "highmem" and "gpu".
As we need different such resources and therefore different queues for different jobs (running sambamba on the FPGA node doesn't make sense), we need a way to select different sets of resources on the Roddy command line.
The solution we take is based on the existing feature that resource sets can be selectively overwritten in the XMLs.
--additionalImports
parameter, which will be used similar to --additionalImports=pbs_fasttrack
or --additionalImports=lsf_fasttrack
(needs to be implemented in Roddy).--additionalImports
parameter need to be in the configuration directories, e.g. in extra files, such as "pbs_fasttrack.xml" or "lsf_fasttrack.xml". These must contain the configurations names: <configuration name="pbs_fasttrack" ...>
used in the --additionalImports
parameter.<tool name='alignAndPairSlim' value='bwaMemSortSlim.sh' basepath='qcPipeline' overrideresourcesets="true">
<resourcesets>
<rset size="t" memory="12" cores="8" nodes="1" walltime="00:10:00" queue="fpga"/>
<rset size="s" memory="12" cores="8" nodes="1" walltime="2" queue="fpga"/>
<rset size="m" memory="17" cores="8" nodes="1" walltime="6" queue="fpga"/>
<rset size="l" memory="20" cores="8" nodes="1" walltime="60" queue="fpga"/>
<rset size="xl" memory="20" cores="8" nodes="1" walltime="160" queue="fpga"/>
</resourcesets>
</tool>
<tool name='mergeAndRemoveDuplicatesSlimSambamba' value='mergeAndMarkOrRemoveDuplicatesSlim.sh' basepath='qcPipeline' overrideresourcesets="true">
<resourcesets>
<rset size="t" memory="30" cores="6" nodes="1" walltime="00:10:00" queue="devel"/>
<rset size="xs" memory="3" cores="6" nodes="1" walltime="1" queue="fasttrack"/>
<rset size="s" memory="4" cores="6" nodes="1" walltime="15" queue="fasttrack"/>
<rset size="m" memory="5" cores="6" nodes="1" walltime="20" queue="fasttrack"/>
<rset size="l" memory="6" cores="6" nodes="1" walltime="30" queue="fasttrack"/>
<rset size="xl" memory="8" cores="6" nodes="1" walltime="80" queue="fasttrack"/>
</resourcesets>
</tool>
The resource set overriding is the already implemented behaviour. Note that
overrideresourcesets="true"
(otherwise also the I/O files are overwritten)value
and basepath
attributes (otherwise this information is not available anymore)rset size="..."
) are given. Resource set size that are not given in this cluster-specific XML will not be available (i.e. the resource sets in the workflow XML are lost).The only feature that needs to be added to Roddy is the --additionalImports
parameter that allows to add imported configurations' names to the project-configuration.
Solved with #111.
In PBS this was implemented via accountName and -A option (might be specific for our cluster though).