marbl / Winnowmap

Long read / genome alignment software
Other
257 stars 23 forks source link

Optimising gaps in whole genome assembly alignments #34

Open mnshgl0110 opened 2 years ago

mnshgl0110 commented 2 years ago

Hi Winnowmap Team,

I am trying to use winnowmap to do whole-genome alignments. In some cases, winnowmap is resulting in large unaligned regions like below: image

However, when I align these unaligned sequence using clustal, they seem to be quite similar. image

Do you have any suggestions on how to tweak winnowmap so that these regions can be aligned as well?

My alignment command is:

winnowmap -W TAIR10_k19.txt -x asm5 -t 10 -ac --eqx -r 10000 TAIR10_Filtered.fasta.gz qry.filtered.fa

Thanks Manish

cjain7 commented 2 years ago

If the input sequences are not too big, you could try reducing k-mer size.

mnshgl0110 commented 2 years ago

I want to use this with syri for genomic differences identification. The size of the sequences can vary (from A. thaliana (130 MB) to humans to oats (10GB)), so it would be great if there is a solution that can be adopted for different genomic sizes.