metagenome assembly from ccs reads

marbl / canu

A single molecule sequence assembler for genomes large and small.

658 stars 179 forks source link

You won't be able to assemble much below about 5-fold coverage. CCS is lower coverage than the long reads typically so you may not have enough reads from the rare species to assemble anything. However, CCS reads should mostly be long enough to span entire bacterial genes so you may just be able to look for genes in the reads directly.

You can run an assembly, specify the reads as -pacbio-corrected and maybe set correctedErrorRate=0.025 (if you believe the 1% error estimate in the data). Any reads that end up in the singletons (asm.unassembled.fasta) you'd have to then annotate directly on the reads.

marbl / canu

metagenome assembly from ccs reads #856