Here, I use a coverage threshold of 10 in the first bam, and a minmum GC of 0.4. Obviously adjust these thresholds to your blobplot. Additional bams would be supported by adding something like (.covs.bam1 > 23) and within the select() function. The resulting goodcontigs.txt is a simple text list of contig names compatible with blobtools seqfilter.
Hello,
This is mostly a PSA, as the following took me way to long to work out myself. Perhaps the authors could add this to the docs somewhere appropriate.
To filter a set of contigs based on the GC content and coverage (a la the blobplot), one can use the following
jq
command:Here, I use a coverage threshold of 10 in the first bam, and a minmum GC of 0.4. Obviously adjust these thresholds to your blobplot. Additional bams would be supported by adding something like
(.covs.bam1 > 23) and
within theselect()
function. The resultinggoodcontigs.txt
is a simple text list of contig names compatible withblobtools seqfilter
.Thanks for a great tool, K