usafsam / mad_river_wf

SARS-CoV-2 analysis workflow, using Nextflow and bbtools
Apache License 2.0
4 stars 2 forks source link

Monkeypox #31

Open fanninpm opened 1 year ago

fanninpm commented 1 year ago

Most of the current workflow should be compatible with the references for hMPXV. However, pangolin seems to be exclusive to SARS-CoV-2.

friesac commented 1 year ago

Yeah, Nextclade should call the clades for us and pangolin can be removed for hmpxv.

fanninpm commented 1 year ago

Code changes to make:

friesac commented 1 year ago
  • modules/local/misc.nf:

    • DATA_PREP:

    • make --schema-name argument customizable

    • in the future, use edirect to download iVar GFF (if the GFF straight from NCBI gives better iVar results than the GFF from nextclade_data)

    • LINEAGE_EXCEL: make pangolin input optional

Yeah, the LINEAGE_EXCEL file would not need to be created, or anything like it, for this initial conversion of taking fastq data and turning it into a consensus fasta.

Can we just ignore edits/improvements to SPIKE_GENE_COVERAGE process while trying to get to hmpxv conversion?

fanninpm commented 1 year ago

As discussed today, we should name these primers something like "hMPXV_YSPH".