TRON-Bioinformatics / covigator-ngs-pipeline

A Nextflow pipeline for NGS variant calling on SARS-CoV-2. From FASTQ files to normalized and annotated VCF files from GATK, BCFtools, LoFreq and iVar.
MIT License
17 stars 7 forks source link

ENH: Add Nextclade for phylogenetic informed QC #55

Open corneliusroemer opened 11 months ago

corneliusroemer commented 11 months ago

Is your feature request related to a problem? Please describe. This looks like a great pipeline that includes a lot of useful tools. I've noticed that you're still adding things to it.

One thing that could make it even better is to integrate Nextclade (disclosure: I work on it)

Besides providing an alternative way to call Pango lineages for SARS-CoV-2, Nextclade generalizes clade calling beyond SARS-CoV-2, e.g. Influenza, mpox, etc.

It also outputs helpful QC metrics of the final consensus genome output:

Nextclade also places a sequence on a phylogenetic tree, which should be appreciated by end users.

Describe the solution you'd like Include Nextclade into the pipeline

Additional context I'd be happy to assist with development of this feature.

priesgo commented 11 months ago

Thanks for your feedback @corneliusroemer , it is a nice compliment coming from someone working in Nextclade!

The QC metrics are very interesting, we will need to understand more of it and think how to deal with it in the pipeline and the dashboard. Also, we have had some performance issues with pangolin when processing a massive number of samples. do you have any comparison of both tools in terms of computational performance?