Closed AlcaArctica closed 1 year ago
Hi @AlcaArctica
A ~50% rate of CCS molecules failing Q20 filters seems not far from expectations for Sequel II SMRT cells. It might be slightly on the higher side of normal, and that could be explained if the read lengths are a bit longer than typical (closer to 20kb versus 15kb), or if the number of passes is for some other reason on the lower side.
I suspect the poorer assembly isn't due to anything in the deepconsensus run, but instead relates to either the genome being complex to assemble or something that would benefit from more coverage than you have.
Thank you, that is reassuring! I think this issue can be closed then. I will continue to explore my assembly workflow.
I am new to pacbio sequencing and have just tried out the deepconsensus pipeline on a fairly large dataset (12 Gbp genome). I obtained 6 subread.bam files which I each chunked into 500 chunks each. My resulting assembly does not look good, but of course there are many possible reasons for that. In any case, I wanted to go back and review my use of the deepconsensus tool. For example, I am running the following commands:
This is my result:
The end of the log file reads:
Now my question is, is there something wrong about how I applied deepconsensus, which could explain my bad assembly? Is it normal to have such a high fail rate (about 50 % for all my chunks)? I have also attached you the full log for further information. log.txt