blab / nextflu

Real-time tracking of influenza evolution
http://nextflu.org
GNU Affero General Public License v3.0
88 stars 19 forks source link

GISAID flu B lineages aren't fully parsed #104

Closed trvrb closed 9 years ago

trvrb commented 9 years ago

I just noticed about 1/3 if all human flu B sequences in GISAID are not assigned Vic or Yam. I noticed this because new B sequences from the global migration paper weren't getting included.

I think to correct for this we could use all B sequences in a pipeline (say for Vic), but then clean to keep only sequences similar enough to modern Vic HA.

rneher commented 9 years ago

Yes, this would probably be a good idea. basically in the script that adds new sequences put in an extra alignment step a recent B sequence from each lineage.

trvrb commented 9 years ago

Fixed in #106.