cfe-lab / MiCall

Pipeline for processing FASTQ data from an Illumina MiSeq to genotype human RNA viruses like HIV and hepatitis C
https://cfe-lab.github.io/MiCall
GNU Affero General Public License v3.0
14 stars 9 forks source link

Aligning to defective start codon in `vpu` #1024

Open Donaim opened 1 year ago

Donaim commented 1 year ago

In our HXB2 reference, vpu region is the following sequence:

TQPIPIVAIV ALVVAIIIAI VVWSIVIIEY RKILRQRKID RLIDRLIERA EDSGNESEGE ISALVEMGVE MGHHAPWDVD DL*

It does not start with M, and thus is defective [1].

It probably does not matter much, since it's a 1 base SNP, at a region boundary. But it has some impact since most genomes do not have this mutation, and so they align worse.