isovic / racon

Ultrafast consensus module for raw de novo genome assembly of long uncorrected reads. http://genome.cshlp.org/content/early/2017/01/18/gr.214270.116 Note: This was the original repository which will no longer be officially maintained. Please use the new official repository here:
https://github.com/lbcb-sci/racon
MIT License
269 stars 49 forks source link

Unequal lengths in target and overlap file for target #188

Closed jules-o closed 3 years ago

jules-o commented 3 years ago

Hi,

I'm sure there's something really obvious here that I'm missing. I ran racon (in Galaxy) to polish a sequence assembled by Flye from Oxford Nanopore data, with the raw nanopore reads as the sequences and a paf overlaps file generated by minimap2 from the raw reads. That worked well, but now I'm trying to do another round of polishing, using the same reads, same overlaps file and all the same settings, except with the output from the first round as the target sequence - and I keep getting "error: unequal lengths in target and overlap file for target contig_1!"

I've tried normalizing the fasta from the first round, also copying the sequence and pasting it straight into a new file in case white spaces or something had somehow got inserted, but it's the same error each time.

What am I doing wrong?

rvaser commented 3 years ago

Hello, you need to create a new overlap file for each iteration by mapping reads to the output of the previous iteration.

Best regards, Robert

jules-o commented 3 years ago

Oh I see - thank you. And sorry for the stupid question. I had misunderstood how the overlaps file is supposed to be created. Probably surprising that it worked at all the first time, since I'd used all-to-all mapping with only the reads, rather than mapping the reads to the target sequence. That explains why it's more accurate now! Thank you for your help, Julia