Closed bsweger closed 1 month ago
nextclade_metadata.json
is stored in a versioned S3 bucket. To see the dates and versionIds:
aws s3api list-object-versions --bucket nextstrain-data --prefix files/ncov/open/metadata_version.json --no-sign-request
To view a specific versionId
of the file:
https://nextstrain-data.s3.amazonaws.com/files/ncov/open/metadata_version.json?versionId=[versionid]
Background
Note requires completion of #20
On the date of model input data processing, we want to save some metadata about the current nextclade SARS-COV-2 dataset (i.e., the most current dataset listed when running
nextclade dataset list --name sars-cov-2
:We'll need this information when it's time to score the models (~90 days after submission) to ensure that we use the correct reference tree for the clade assignments. This is also important information for reproducibility (e.g., the nextclade version used to generate the data).
Definition of done
Update the function created in #20 (which retrieves the latest version of
nextclade_metadata.json
nextclade_metadata.json
file (see related incantations below)