Open hyanwong opened 3 weeks ago
ORF1ab starts at 266, so this is in the "extra-genic" flanks that we're excluding unconditionally.
@szhan - this is your call, that do you think?
I've no problem including stuff outside the genes, but we need to make a decision on this pretty quickly. @szhan - what's your thoughts? If these are phylogenetically useful, then there's not much justification for excluding them?
Hmm, we are excluding the 5' and 3' UTRs.
As discussed earlier with @jeromekelleher, we are going to redo the run including the UTR sites, and use the resulting ARG to identify problematic sites.
Site 241 is currently excluded, but
241T
is a "defining" mutation for B.1 lineages, and therefore for a large number of samples high up in the tree. If it is a borderline-site for exclusion, I reckon we should probably include it. I opened this issue just to track our rationale for including / excluding that particular site