dib-lab / dammit

just annotate it, dammit!
http://dib-lab.github.io/dammit/
Other
88 stars 28 forks source link

include BUSCO hits in gff3 #65

Open macmanes opened 8 years ago

macmanes commented 8 years ago

Camille,

I want to retain all contigs for which any annotation data exists, including BUSCO hits. This would be easy if the BUSCO results were including in the final gff3 file in some way. What do you think about including them?

camillescott commented 8 years ago

I definitely support it. Do you have ideas on how to represent these in the gff3? The standard really demands having start/end coordinates for each feature; I suppose I could just have the features span the entire transcript (which seems kludgy/inaccurate), or pull coordinates from the BLAST or hmmer outputs. What do you think?

camillescott commented 7 years ago

Been looking into this. The output from BUSCO is very raw -- many hits per BUSCO, sometimes multiple per transcript, combined from both hmmer and tblastn. I think the short term solution for 1.0 will just be to set the start and end as the start and end coordinates of the transript, and think about dealing with it more intelligently later.