Bioinformatics workflows for genomic characterization, submission preparation, and genomic epidemiology of viral pathogens of concern, especially SARS-CoV-2
This PR adds a new column "tamiflu_resistance_aa_subs" containing nextclade-detected substitutions that have been described in the literature to confer resistance to tamiflu.
This is currently hard-coded into the nextclade_output_parser_one_sample task in the task_taxonID.wdl file.
The current behaviour of the workflow can be summarized in the following points:
when the "flu" organism is set, the read data is assembled by IRMA which returns both the HA and NA sequence fragments.
abricate returns the appropriate nextclade_ref, nextclade_name and nextclade_ds_tag for both HA and NA depending on the subtype detected
for flu, nexclade runs twice, once for the HA segment and the second time for the NA segment. When running for the NA segment, it compares the detected list of aa substitutions with the list of tamiflu-resistance-associated substitutions, returning the intercept of both lists
Testing
Locally
miniwdl run ~/Git/public_health_viral_genomics/workflows/wf_theiacov_illumina_pe.wdl samplename= BigTest read1_raw= ~/Test/tamiflu_resistance/SRR18273525_1.fastq.gz read2_raw= ~/Test/tamiflu_resistance/SRR18273525_2.fastq.gz organism="flu"
Motivation
This PR adds a new column "tamiflu_resistance_aa_subs" containing nextclade-detected substitutions that have been described in the literature to confer resistance to tamiflu.
The current list of substitutions is as follows:
This is currently hard-coded into the
nextclade_output_parser_one_sample
task in thetask_taxonID.wdl
file. The current behaviour of the workflow can be summarized in the following points:nextclade_ref
,nextclade_name
andnextclade_ds_tag
for both HA and NA depending on the subtype detectedTesting
Locally
miniwdl run ~/Git/public_health_viral_genomics/workflows/wf_theiacov_illumina_pe.wdl samplename= BigTest read1_raw= ~/Test/tamiflu_resistance/SRR18273525_1.fastq.gz read2_raw= ~/Test/tamiflu_resistance/SRR18273525_2.fastq.gz organism="flu"
Terra
Test 1 - Random SRA accessions Test 2 - Samples 01 to 04 of theiacov flu demo dataset