Closed yeban closed 4 years ago
1, Cannot change BIN size.
2, --dp-max-var
restricts the max deviation off diagonal in a gap.
3, -s
is more easy to control.
4, -l
in bases. -m
also in bases, it sums all kmers' length, but removes their overlaps.
Jue
Thanks!
wtdbg2 produced a good assembly on my Pacbio dataset (50x) using default parameters (
-g 450g -x seqel
). I was just wondering if I can get a bit more out of the dataset by adjusting a few parameters. But I don't understand what some of the parameters mean:--aln-kmer-sampling
option?--dp-max-var
) mean? Is it similar to what is conventionally the penalty for gap extension?-s
) or increasing the gap penalty (--dp-penalty-gap
)? Should I also consider increasing the gap variation penalty (--dp-penalty-var
)?-l
) defined by the number of base pairs, k-mers or K-bins? Accordingly, how does it differ from-m
option?Thanks in advance for your thoughts.