nextstrain / ncov-ingest

A pipeline that ingests SARS-CoV-2 (i.e. nCoV) data from GISAID and Genbank, transforms it, stores it on S3, and triggers Nextstrain nCoV rebuilds.
MIT License
36 stars 20 forks source link

feat: switch to nextclade3, by downloading "latest" binary #435

Closed corneliusroemer closed 7 months ago

corneliusroemer commented 7 months ago

Modernize nextclade rule, removing unnecessary args (e.g. --genes) and make it v3 compliant

Necessary to use latest SARS-CoV-2 dataset to get latest lineages

Summary of changes:

Tested with debug profile, but it's a bit tricky in practice, so will do another test run.

Also:

corneliusroemer commented 7 months ago

I'll add new touchfiles so we do a full rerun - it's important we check the live output - it should be ok but just in case.

Ran:

aws s3 cp - s3://nextstrain-ncov-private/nextclade.tsv.zst.renew < /dev/null
aws s3 cp - s3://nextstrain-data/files/ncov/open/nextclade.tsv.zst.renew < /dev/null