cidgoh / nf-ncov-voc

A Nextflow wrapped workflow for generating the mutation profiles of SARS-CoV-2 genomes (Variants of Concern and Variants of Interest). Workflow is developed in collaboration with COVID-MVP (https://github.com/cidgoh/COVID-MVP) which can be used to visualize the mutation profiles and functional annotations.
MIT License
5 stars 5 forks source link

Update scripts to accomodate re-organized pokay #142

Open anwarMZ opened 11 months ago

anwarMZ commented 11 months ago

Is your feature request related to a problem? Please describe. Pokay has been updated to host other priority pathogens in addition to SARS-CoV-2 data. We need to update the script: functional_annotation.py that digests pokay literature and fill in the functional annotation template.

Currently, the script takes two arguments

        description='This script produces a TSV file from TXT files '
                    'in POKAY  '
                    'https://github.com/nodrogluap/pokay/tree/master'
                    '/data for annotating SARS-COV-2 mutations')
    parser.add_argument('--inputdir', type=str, default=None,
                        help='directory path for input files')
    parser.add_argument('--outputfile', type=str, default=None,
                        help='output file (.TSV) format')

Describe the solution you'd like The following adjustments can help:

miseminger commented 10 months ago

Only read CDS_mature_peptides from the JSON for the protein_product