marbl / Winnowmap

Long read / genome alignment software
Other
235 stars 22 forks source link

Optimising gaps in whole genome assembly alignments #34

Open mnshgl0110 opened 1 year ago

mnshgl0110 commented 1 year ago

Hi Winnowmap Team,

I am trying to use winnowmap to do whole-genome alignments. In some cases, winnowmap is resulting in large unaligned regions like below: image

However, when I align these unaligned sequence using clustal, they seem to be quite similar. image

Do you have any suggestions on how to tweak winnowmap so that these regions can be aligned as well?

My alignment command is:

winnowmap -W TAIR10_k19.txt -x asm5 -t 10 -ac --eqx -r 10000 TAIR10_Filtered.fasta.gz qry.filtered.fa

Thanks Manish

cjain7 commented 1 year ago

If the input sequences are not too big, you could try reducing k-mer size.

mnshgl0110 commented 1 year ago

I want to use this with syri for genomic differences identification. The size of the sequences can vary (from A. thaliana (130 MB) to humans to oats (10GB)), so it would be great if there is a solution that can be adopted for different genomic sizes.