Closed iskandr closed 8 years ago
Confirmed that Variant("chr13", 5864876, "", "G", "GRCm38").effects()
gives a single frame shift that continues for 213 amino acids starting with "GEKKEESELKISSSPPEDSLISSSFNYNLETNSLNSDVSSESSDSSEELSPTTK..."
The reference sequence on the poster seems to match what I get in PyEnsembl on Klf6-201 starting from offset 459.
GCT_CGG_GGG_GAG_AAG_AAG_GAG_GAA_TCA_GAA_CTG_AAG_ATT_TCT_TCT_AGT_CCC_CCA
-A- -R- -G- -E- -K- -K- -E- -E- -S- -E- -L- -K- -I- -S- -S- -S- -P- -P-
With the insertion of a G after position 462 we get a sequence of:
GCT_CGG_GGG_GGA_GAA_GAA_GGA_GGA_ATC_AGA_ACT_GAA_GAT_TTC_TTC_TAG
-A- -R- -G- -G- -E- -E- -G- -G- -I- -R- -T- -E- -D- -F- -F- *
...which also matches the poster. So, how do we get a different sequence from Varcode? Opening an issue there.
In the CIMT poster "Neo-epitopes generated by insertions, deletions, and gene fusions as target candidates for personalized tumor vaccination" they translate the mutation mm9:chr13 g.5864121_5864122insG as RG**GEEGGIRTEDF***
In John F's sequencing of B16 he found mm10:chr13 g.5864876_5864877insG, from which Topiary predicts the following epitopes: