Closed halexand closed 5 years ago
For me the following worked:
rule diamond_bin_summary:
input: expand("diamond_report/bin.{sample}.tsv", sample=IDS)
output: "diamond_bin_report.tsv"
shell:
"""
echo -e 'name\tnprots\tnhits\tnfull\tgenus\tngenus\tspecies\tnspecies\tavgpid' >> {output}
for x in {input}
do
cat $x >> {output}
done
"""
Happy to turn it into a PR if you like.
Hello, thanks for this, yes please submit a PR that looks about right :)
Hello,
I am trying to run MAGpy with ~3000 MAGs. When working with this larger number of mags I have come upon this error in the rule
diamond_bin_summary
:It would appear that trying to concatenate the output from all the diamond results is a bit too much for my particular system. I wonder if breaking it into a for loop would make the pipeline more extendable.