replikation / poreCov

SARS-CoV-2 workflow for nanopore sequence data
https://case-group.github.io/
GNU General Public License v3.0
40 stars 17 forks source link

Output QC #13

Closed hoelzer closed 3 years ago

hoelzer commented 3 years ago

I think currently there is no quality check of the generated consensuses?

I suggest having at least some simple qc e.g.

to automatically decide which samples yield good enough consensus sequences for further processing.

We could even simply implement a python script that people are working on right now: https://gitlab.com/RKIBioinformaticsPipelines/president

Here, an input FASTA sequence is aligned to the Wuhan reference strain an a pairwise identity is calculated and reported in a tabular format.

replikation commented 3 years ago

agree we can sort the output a bit based on the alignment (wuhan is included here: https://github.com/replikation/poreCov/tree/master/data/reference_nCov19)