chadlaing / Panseq

Pan-genomic sequence analysis
http://lfz.corefacility.ca/panseq
GNU General Public License v3.0
43 stars 14 forks source link

the result #23

Closed liuyongming0612 closed 6 years ago

liuyongming0612 commented 6 years ago

why the result of panGenome.fasta have so many sequence of "NNNN......."

chadlaing commented 6 years ago

Hi @liuyongming0612,

The pan-genome having so many Ns would be due to the input sequences having so many Ns. The Ns will show up as not-matching anything, and therefore be included in the pan-genome because they are "novel".

I would suggest either removing them from your input genomes, or the generated pan-genome.

I hope that helps, Chad