Closed lizakulaeva closed 3 weeks ago
Hi @lizakulaeva!
This is a strange issue, it indicates that the length of a sequence in the overlap stage does not match the length of the sequence fetched from the FASTQ in the inference stage. But considering your command line (dorado correct --device cuda:all output.fastq > output_corrected.fasta
), the same set of sequences is used in both cases.
Can you please check that there aren't duplicate sequence names in your output.fastq
file? How was this file generated?
Thanks!
Hi @svc-jstone I've found a reason for this error message: it was because I've used the old .fai index file and forgot to delete it before running correction for the new dorado output. Hope my rookie mistake will help someone :) I'm closing this issue now.
Hi @lizakulaeva, thank you very much for reporting this back!
Hi, I have run dorado 0.8.0 simplex basecalling with the sup model, and now I am trying to perform the read correction with dorado correct. I have mentioned these swarm of errors appeared in the log file of correction script:
...and it goes for every ID.
Is it a critical warning that will influence the corrected output? If yes, is there a way to solve it? Thank you in advance!
Run environment:
dorado correct --device cuda:all output.fastq > output_corrected.fasta