voutcn / megahit

Ultra-fast and memory-efficient (meta-)genome assembler
http://www.ncbi.nlm.nih.gov/pubmed/25609793
GNU General Public License v3.0
588 stars 134 forks source link

How to get the reads which formed the contigs in Megahit #235

Closed RamaSravanimeduri closed 5 years ago

RamaSravanimeduri commented 5 years ago

Hi Team,

After running megahit on my data, it gave about 3 million contigs. But now, I need which reads formed these contigs to do further analysis. How to get the corresponding reads R1 and R2.

Thanking you, Rama.

sjaenick commented 5 years ago

You'll need to map them back to the contigs using a short-read aligner, e.g. bowtie2.

kislyuk commented 3 years ago

Hi, just wanted to double check - is there no way to recover read information from intermediate assembler output, instead of realigning? Aside from the added complexity, I'm worried that realigning is fundamentally less reliable than getting the read IDs from the assembler - there is no explicit guarantee that the realigned reads will map back to the location where they were used in the assembly.