The Andersen lab will continue to update consensus sequences from SRA runs. These eventually get uploaded to GenBank by the original submitters and become available through the NCBI data. However, there is a delay in the data through NCBI, so we can merge the Andersen lab data with the NCBI data to get the latest available sequences.
Both sets of data have the SRA accession, so we can use that to dedup the data.
Follow up to https://github.com/nextstrain/avian-flu/pull/28 + https://github.com/nextstrain/avian-flu/pull/40
The Andersen lab will continue to update consensus sequences from SRA runs. These eventually get uploaded to GenBank by the original submitters and become available through the NCBI data. However, there is a delay in the data through NCBI, so we can merge the Andersen lab data with the NCBI data to get the latest available sequences.
Both sets of data have the SRA accession, so we can use that to dedup the data.