Closed FedeGueli closed 2 months ago
Just copying and pasting my previous comment on ORF7a:F59K below.
ORF7a:F59K is a really bizarre mutation—it's a triple T->A mutation. T->A is the third-rarest type of nuc mutation, so there's no way this happened stepwise. It's happened numerous times independently throughout the pandemic, so I have no doubt that it's real. There are now 50 sequences total with ORF7a:F59K that I think are legit.
There are other double-A or triple-A mutations but they usually fit one of a number of patterns, the main ones being TRS-like motifs and recombination with the region just before the N start codon. This one doesn't fit either pattern. However, with it, 25764-25774 makes for a perfect reverse complement of an 11-nucleotide stretch in NSP6 (11131-11141). I regard stretches of 8-9 nucleotides as likely being coincidental, but 11 nucleotides is unlikely to be a coincidence.
So my guess is that this triple-T->A mutation is due to recombination with this this region of NSP6, though I don't feel overly certain about that.
It's also worth noting that there's a second stretch of nucleotides, 7593-7602, that forms the ORF7a:F59K region forms a perfect reverse complement with over a 10-nucleotide stretch, which again is more than you'd expect to find by coincidence.
And here's another fascinating aspect of this triple T->A mutation I'd discussed elsewhere but forgot to mention in the previous issue: it zips up three unpaired nucleotides on the stem part of a stem-loop, perfectly matching three unpaired T's on the opposite side. This is far from the only multi-nuc mutation to follow this pattern, some of which involve even more extensive mutations.
Thanks a lot @ryhisner !
+1 Canada
Designated JN.1.16.4 and MT.1
thx @DailyCovidCases i was convinced it was by @aviczhl2 sorry.
from https://github.com/sars-cov-2-variants/lineage-proposals/issues/1702
Defining mutations: Jn.1.16 >> A532G, Orf1a.G400S (G1463A). > Orf7a:F59K (T27568A, T27569A, T27570A) > S:Q493E (C23039G) >> N:P279S (C29108T) > S:S31P (T21653C) Query for Orf7a:F59K: A532G,C23039G,T3565C [Old queries: C23039G, C29108T, T27570A by @aviczhl2 or A532G, G1463A,C4543T initial] Query for S:S31P: T21653C, T27570A Query for S:R346T: C29108T, T27570A,G22599C (by @aviczhl2 ) Samples: 18 (17 with S:Q493E) Countries: 10 sorted by date of collection: Taiwan (GBW), Singapore, Japan, Australia, Netherlands, Taiwan (not GBW), South Korea, US-NY, Canada, Italy, China
Tree (Updated on 08-10):
https://nextstrain.org/fetch/genome-test.gi.ucsc.edu/trash/ct/singleSubtreeAuspice_genome_test_5ada8_700150.json?f_userOrOld=uploaded%20sample&label=id:node_7234797
I strongly reccomend to read the @ryhisner analysis of Orf7a:F59K triple nuc mutation: here
IDs: EPI_ISL_19240822, EPI_ISL_19240865, EPI_ISL_19268033, EPI_ISL_19282315, EPI_ISL_19287866, EPI_ISL_19287875, EPI_ISL_19287877, EPI_ISL_19293072, EPI_ISL_19298328, EPI_ISL_19304934, EPI_ISL_19305174, EPI_ISL_19305194, EPI_ISL_19309288, EPI_ISL_19311730, EPI_ISL_19311825, EPI_ISL_19324738, EPI_ISL_19324755, EPI_ISL_19324796