PoonLab / covizu

Rapid analysis and visualization of coronavirus genome variation
https://filogeneti.ca/CoVizu/
MIT License
45 stars 20 forks source link

Build separate tree for XBB lineages #489

Closed ArtPoon closed 7 months ago

ArtPoon commented 1 year ago

There is an enormous number of recombinants now. Some of them can be displayed in a tree (e.g., XBB) but there are also recombinants from recombinant "parent" genomes.

ArtPoon commented 11 months ago

My concern is that eventually everything is going to be a recombinant.

ArtPoon commented 11 months ago

We could try a drop-down menu to select trees: one tree would be the original relating non-recombinants, and other trees would relate recombinant lineages that descend from a common ancestor (e.g., XBB) - then we would have a set of recombinant lineages without descendant lineages

ArtPoon commented 10 months ago

@GopiGugan I will take care of the lineage sorting and tree reconstruction on the back end, can you please handle changes to the front end (adding drop-down menu to switch between trees / non-recombinants)?

ArtPoon commented 10 months ago

Branch xbbtree will now separate XBB and XBB-derived lineages into a separate object for reconstructing a tree with filename xbbtree.[timestamp].nwk.

ArtPoon commented 10 months ago

https://github.com/PoonLab/covizu/blob/57ee3276c316554bc93e9f8f0562f2d55b18dfd1/batch.py#L152-L199

ArtPoon commented 10 months ago

I think in master branch we have been building a time tree with all lineages, including XBB and other recombinants, and then just not drawing those branches for the front-end interface. This might not have been a horrible approximation last year, but it might be causing problems now. Anyhow, above changes should address this if true.

ArtPoon commented 9 months ago

Because I am pulling out XBB and other recombinant lineages from the data prior to reconstructing the non-recombinant tree, we are not going to get residuals from fitting a molecular clock to that tree for the pulled lineages. I'm going to need to calculate an approximate divergence for each of these lineages by counting mutations and adjusting by clock estimate.

GopiGugan commented 9 months ago
image

Pending items:

ArtPoon commented 9 months ago

Let's just have the CSV button download info for all lineages, thanks

ArtPoon commented 8 months ago

Because I am pulling out XBB and other recombinant lineages from the data prior to reconstructing the non-recombinant tree, we are not going to get residuals from fitting a molecular clock to that tree for the pulled lineages. I'm going to need to calculate an approximate divergence for each of these lineages by counting mutations and adjusting by clock estimate.

Actually we should be running timetree on the XBB tree (as long as those lineages are non-recombinant relative to ancestral XBB) and extracting residuals there. The counting mutations workaround should be used for "other recombinants" only.

ArtPoon commented 8 months ago

Because I am pulling out XBB and other recombinant lineages from the data prior to reconstructing the non-recombinant tree, we are not going to get residuals from fitting a molecular clock to that tree for the pulled lineages. I'm going to need to calculate an approximate divergence for each of these lineages by counting mutations and adjusting by clock estimate.

Actually we should be running timetree on the XBB tree (as long as those lineages are non-recombinant relative to ancestral XBB) and extracting residuals there. The counting mutations workaround should be used for "other recombinants" only.

Erg, I forgot that I already did this (run timetree on the XBB lineages and extract residuals). Let's just not colour the "other recombinant" lineages by divergence for now.