Closed jelber2 closed 2 years ago
Sorry we didn't answer this issue sooner. No, you do not need to sort and index any bam files, and in fact I would not recommend it because DeepConsensus assumes that the inputs are in the original order that ccs and actc output. Changing that order by sorting might cause some data to be skipped during processing. We'll take a look and see if it's possible to turn off that warning, which I'm guessing is from pysam that we use for parsing the bam files.
Ok thank you! I have not noticed anything with data being skipped, but maybe I was not looking carefully enough.
Following the tutorial, https://github.com/google/deepconsensus/blob/f1413ee0802dd09fb5a4507983314935e32ab482/docs/quick_start.md?plain=1#L95 , deepconsensus-0.2.0 complains that it cannot find the index for subreads_to_ccs.bam . Sorting with samtools sort and then samtools index fixes the warning, but is it necessary?