Closed jbloom closed 1 year ago
Oops, sorry about that and thanks for pointing it out! Should be fixed in the 2022-11-07 build, look for it around 7pm Pacific time this evening.
Awesome, thanks so much!
Thanks so much, @AngieHinrichs. The counts now seem more sensible in the 2022-11-07 tree: very few 21M counts and many more 21L:
nextstrain_clade,count
19A,12026
19B,6377
20A,112252
20B,101283
20C,61246
20D,5785
20E,101100
20F,11458
20G,72943
20H,7896
20I,599147
20J,26651
21A,1886
21B,1004
21C,41500
21D,2592
21E,46
21F,33178
21G,1323
21H,5553
21I,191546
21J,2558394
21K,968268
21L,848186
21M,170
22A,75017
22B,434838
22C,154997
22D,5165
22E,9987
22F,340
The pre-built
UShER
trees have many more sequences assigned Nextstrain clade 21M than 21L.Talking to @rneher, he thinks this might be some sort of mis-categorization? @AngieHinrichs, he suggested asking you about it.
For instance, here are the clade counts I get for the 2022-09-20 tree. You can see there are many more 21M samples (829,164) than 21L samples (10,648):
It's similar with the 2022-11-04 tree: