cov-lineages / pango-designation

Repository for suggesting new lineages that should be added to the current scheme
Other
1.04k stars 97 forks source link

ambiguous lines in lineage_notes.txt #1519

Closed PeterSlickers closed 1 year ago

PeterSlickers commented 1 year ago

I use a script which reads the files lineage_notes.txt, lineages.csv and alias_key.json. The script checks for consistency before further processing the data. In my understanding, there should by exactly one line per lineage in file lineage_notes.txt. However, I have found three doublets.

grep BA.2.3.3 lineage_notes.txt
BA.2.3.3        Alias of B.1.1.529.2.3.3, Papua New Guinea and Australia lineage, from pango-designation issue #524
*BA.2.3.3       Withdrawn: Does not have a reliable defining mutation

grep BA.5.1.13 lineage_notes.txt
BA.5.1.13       Alias of B.1.1.529.5.1.13, mainly found in Brazil, from pango-designation issue #955
*BA.5.1.13      Withdrawn: Defined solely by Artic v3 artefacts, see issue #1029

grep AY.39.4 lineage_notes.txt
AY.39.4 Alias of B.1.617.2.39.4, mainly found in Denmark, from pango-designation issue #406
*AY.39.4        Withdrawn: Reassigned AY.134. Denmark lineage

I also noted some putative typos ("Withdrawn," instead of "Withdrawn:"):

grep -i Withdrawn, lineage_notes.txt
*B.1.1.235      Withdrawn, merged with B.1.1.115
*B.1.177.13     Withdrawn, now part of B.1.177.17

grep liniage lineage_notes.txt
B.1.36  Global many subliniages (see below)
B.1.436 USA liniage reassigned from previous tree 
AngieHinrichs commented 1 year ago

Thanks @PeterSlickers! I will clean those up.