vector-engineering / covidcg

A COVID-19 CoV Genetics (CG) browser to inform therapeutics development
https://covidcg.org
MIT License
26 stars 5 forks source link

RSV: mine subtypes from NCBI GenBank #567

Open atc3 opened 1 year ago

atc3 commented 1 year ago

Some sequences that don't include G are assigned a genotype (A, B) by the submitter, and it's clearly marked under the "source" field of the "FEATURES" section in the GenBank entry. For example: JF905542

Not sure if we can pull this data from the NCBI Virus API but if we can, we should look for this data and substitute it in when our genotype alignments fail