seattleflu / id3c-customizations

Extensions of ID3C for the Seattle Flu Study
MIT License
3 stars 0 forks source link

Ingest sequencing accession IDs #387

Closed davereinhart closed 3 weeks ago

davereinhart commented 2 months ago

The clinical data module is being updated to parse sequencing accessioning data for uploading into receiving.clinical, and the clinical ETL is being updated for ingestion into warehouse tables. This data is being sourced from tracking sheets maintained on Github for the Seattle Flu Study.

Running this ETL on receiving.clinical records with document containing gisaid_accession or genbank_accession will result in custom processing for this particular type of data. After matching to an existing sample, a minimal consensus_genome and genomic_sequence record will be generated for each covid-19, RSV-A, RSV-B, Influenza A and Influenza B sequence record.