phiweger / zoo

A portable datastructure for rapid prototyping in (viral) bioinformatics (under development).
5 stars 2 forks source link

gff3 + fasta to json #113

Open phiweger opened 7 years ago

phiweger commented 7 years ago

spec: https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md

convert: http://pythonhosted.org/mgkit/scripts/filter-gff.html http://pythonhosted.org//mgkit/_modules/mgkit/io/gff.html#Annotation.to_json http://pythonhosted.org//mgkit/_modules/mgkit/io/gff.html#from_gff

less entangled: https://github.com/chapmanb/bcbb/tree/master/gff with tutorial: http://biopython.org/wiki/GFF_Parsing

above recommends using: https://github.com/daler/gffutils // they use sqlite to store data in, not really good, adds too much complexity in our scenario