Closed user-tq closed 1 year ago
Currently we have only left-normalizing functionality, in bcftools norm
. The consequence annotation in bcftools csq
is able right-align internally and give a consequence based on that when close to exon boundaries, but only to a limited extent, you may want to try that.
I can imagine extending bcftools norm
to support flexible left vs right alignment for cases where there is no overlapping transcipt on the opposite strand.
I checked that in this particular case bcftools csq
gives the same prediction for both left- and right- aligned indel.
After checking the HGVS recommendations I am confused though: your example is in the gene EGFR which is on the plus strand, therefore this specific case should be left-aligned, no?
Sorry for taking so long to reply, I think I need to clarify my question again,If there are any issues with my understanding, please point out. Direction of NM_005228 is + ,i get it from this.
GeneID:1956 ENSG00000146648.21 HGNC:3236 EGFR epidermal growth factor receptor NM_005228.5 NP_005219.2 ENST00000275493.7 ENSP00000275493.2 MANE Select 7 55019017 55211628 +
(I understand +
as the direction of the transcript is the same as that of ref, Because ref is also ‘+’ strand of chromosomes)
left----------------ref------------------right
5’--------------transcript----------------3‘
now, bwa(or other alignment tool)say:the del should be the left of ref. hgvs say: the del should be the 3' pos of transcript. In the end, a different representation emerged。
GGAATTAAGAGAAG
G---------GAAC
GGAA---------C
Of course, they are actually the same thing。 As a result, different VCF displays have also emerged。
# left- aligned
#CHROM POS ID REF ALT QUAL FILTER INFO
7 55242465 . GGAATTAAGA G . . .
7 55242478 . G C . . .
and
#hgvs
#CHROM POS ID REF ALT QUAL FILTER INFO
7 55242469 . TTAAGAGAAG C . . .
You are right, my mistake. Variants on + strand transcripts should be right-aligned while - strand transcript variants should be left-aligned.
I just added support for this via a new option -g, --gff-annot
. Please try it out and let me know if you spot anything odd.
Recently, I encountered a complex variant with an IGV image of Simply put, this mutation is
I called it using mutet2, and ultimately it was represented as
This seems to follow the left alignment rule
But according to the 3'rule of hgvs
(I confirmed the direction of my focus on transcript NM_005228) this mutation is
I also annotated the final results using VEP. Obviously, VEP does not merge p.hgvs variants.
I am looking for tools to adjust the location of del occurrence based on the direction of the transcript of interest to meet the final HGVS requirements. It seems that bcftool cannot achieve this either, Can corresponding functions be added? Thank you for any suggestions