cidgoh / nf-ncov-voc

A Nextflow wrapped workflow for generating the mutation profiles of SARS-CoV-2 genomes (Variants of Concern and Variants of Interest). Workflow is developed in collaboration with COVID-MVP (https://github.com/cidgoh/COVID-MVP) which can be used to visualize the mutation profiles and functional annotations.
MIT License
4 stars 4 forks source link

update the gff2json.py to integrate into nextflow #141

Open anwarMZ opened 9 months ago

anwarMZ commented 9 months ago

Is your feature request related to a problem? Please describe. The gff2json.py script requires a dictionary file. This is a cumbersome step. For example for mpox

{
  "reference": "Nigera-2018",
  "accession": "NC_063383.1",
  "species": "https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=10244",
  "genome": "ACTG"
 }

Describe the solution you'd like This can be removed by providing the

example updated usage - usage: gff2json.py [-h] [--gff_file GFF_FILE] [--ref_fasta FASTA] [--output_json JSON] [optional; --colors_scheme JSON]

anwarMZ commented 9 months ago

Removing the genome key from the output fasta file so we don't need the fasta file as an argument.

anwarMZ commented 9 months ago

For the colors, intergenic is grey by default