blab / cartography

Dimensionality reduction distills complex evolutionary relationships in seasonal influenza and SARS-CoV-2
https://doi.org/10.1101/2024.02.07.579374
MIT License
4 stars 1 forks source link

Evaluate within-clade t-SNE cluster accuracy for SARS-CoV-2 #123

Closed huddlej closed 1 month ago

huddlej commented 1 month ago

Description

Adds rules and scripts to the early SC2 workflow to sample ~2000 sequences from Nextstrain clade 21J (Delta), find clusters from a t-SNE embedding of these sequences, and compare these clusters to collapsed Pango lineages. Adds a new paragraph to the end of the main SARS-CoV-2 section of the manuscript, adds a supplemental figure of counts per lineage/cluster, and adds a row to the supplemental accuracy table with the accuracy of the clusters from this analysis.

Related issues

Closes #110