Open schraderL opened 2 years ago
Hi, I am running diamond blastx on some individual eukaryotic scaffolds on a HPC environment with slurm with the following settings:
diamond blastx
slurm
diamond blastx \ --query ${assembly} \ --db uniprot/reference_proteomes.dmnd \ --outfmt 6 qseqid staxids bitscore qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore \ --fast \ --max-target-seqs 1 \ --evalue 1e-25 \ --threads ${threads} \ > ${assembly}.diamond.blastx.out
I have run this with 36 threads and 90 GB of RAM. However, CPU & RAM efficiency are both less than 5 % according to slurm:
Nodes: 1 Cores per node: 36 CPU Utilized: 02:06:08 CPU Efficiency: 2.91% of 3-00:10:12 core-walltime Job Wall-clock time: 02:00:17 Memory Utilized: 4.20 GB Memory Efficiency: 4.66% of 90.00 GB
Is there a way to improve this run so that diamond can more effectively use the resources available?
diamond
Thanks! Lukas
The problem is probably the long input sequences which is not efficient in regular blastx mode, try setting -F 15. Additionally I'd recommend -b4 -c1.
-F 15
-b4 -c1
Hi, I am running
diamond blastx
on some individual eukaryotic scaffolds on a HPC environment withslurm
with the following settings:I have run this with 36 threads and 90 GB of RAM. However, CPU & RAM efficiency are both less than 5 % according to
slurm
:Is there a way to improve this run so that
diamond
can more effectively use the resources available?Thanks! Lukas