Closed joverlee521 closed 4 weeks ago
I was curious how well annotated is the is-lab-host
field, so I looked into seasonal-cov/229e to compare with the excluded passage strains.
This makes sense because the is-lab-host
field seems to be dependent on the presence of the source/lab_host
field in the GenBank record. The records that do not match do not have the source/lab_host
field but they were flagged as passaged because they included in a paper.
Prompted by https://github.com/nextstrain/lassa/pull/19#discussion_r1707512603
Including
is-lab-host
andlab-host
fields from NCBI Datasets will help with programmatically excluding lab passaged sequences from builds.