Open bsweger opened 2 weeks ago
nextclade_metadata.json
is stored in a versioned S3 bucket. To see the dates and versionIds:
aws s3api list-object-versions --bucket nextstrain-data --prefix files/ncov/open/metadata_version.json --no-sign-request
To view a specific versionId
of the file:
https://nextstrain-data.s3.amazonaws.com/files/ncov/open/metadata_version.json?versionId=[versionid]
Background
On the date of model input data processing, we want to save some metadata about the current nextclade SARS-COV-2 dataset (i.e., the most current dataset listed when running
nextclade dataset list --name sars-cov-2
:We'll need this information when it's time to score the models (~90 days after submission) to ensure that we use the correct reference tree for the clade assignments. This is also important information for reproducibility (e.g., the nextclade version used to generate the data).
Definition of done
Add a function to
virus-clade-utils
:nextclade_metadata.json
file (see related incantations below)