Closed zy041225 closed 5 years ago
Hi,
Both kmers looks good. The kmer-distribution is used in indexing, if your assembly is good enough, ignore it, otherwise please make sure most of k-mers lay within 2~<-K>, esp. 95% quantile not too high.
I have no more suggestion on how to tuning on corrected reads, Ihave little experience with it.
Best, Jue
Thanks for the answer.
Hi Jue,
I'm trying to assemble a genome using wtdbg2 with default parameters. I've run two kinds of assembling, one only using wtdbg2 with raw sequel reads, and one using CANU first to get corrected reads then assembled with wtdbg2. I found the k-mer distributions are different between them (the first one was generated from raw sequel reads, the second one was generated from CANU-corrected reads), and it seemed that the corrected-reads version looks better. However, my CANU-wtdbg2 assembly is 200Mb smaller than the genome size I estimated from Illumina reads while the wtdbg2-only assembly size is OK. Thus I would like to ask if the k-mer distribution of the raw reads is good to proceed and the result is reliable. Besides, do you have any suggestion about parameter tuning for the corrected version?
Here are my parameters used in wtdb2
Many thanks
Best Yang