nextstrain / seasonal-flu

Scripts. config, and snakefiles for seasonal-flu nextstrain builds
44 stars 26 forks source link

Summarize haplotype coverage by titer references using frequencies per haplotype from all available data #173

Closed huddlej closed 1 month ago

huddlej commented 2 months ago

Description of proposed changes

Replaces the current table of derived haplotype frequencies and titer references that is based on a subsampled HA tree with a table based on all available sequences during the same time period.

With the latest version of Nextclade, we can determine derived haplotype strings per record from a Nextclade annotations file with columns for clade and mutations relative to each clade. We can then calculate haplotype frequencies from all available data instead of a subset of data used to build a tree.

Development checklist

Related issue(s)

Related to #130 Depends on https://github.com/nextstrain/nextclade/pull/1492

Checklist