Open tseemann opened 3 years ago
Maybe this is a bigger problem of how to handle NNNNNNN
in reference genome?
The indexing method would assign a random nucleotide for each 'N'. It requires the original reference genome to verify the true nucleotides.
So bwt_index is putting ranom base for N.
I think you need to correct this problem in your output?
Until you fix it, bcftools
will not work on your VCF files.
Also the GAP is starting too late? Souldn't it begin where the first N is? Or are reads actually aligning to the random bases??
This A is wrong? Maybe it should be
.
?And it starts the Gap too late?