Closed JC-therea closed 2 years ago
It is most likely failing on trying to compare your original GFF annotation and the GBK "updated" annotation. The first GFF3 parser likely skipped a bunch of annotations it didn't understand, but perhaps one or many of those got through to the compareannotations2 function, ie:
[Jun 03 04:36 PM]: Previous annotation consists of: 5,132 protein coding gene models and 7,592 non-coding gene models
So I'm wondering if there is something off with the GFF3 file here, ie 7500 non-coding gene models???
Yeah... my model has a lot (I don't know if all of them are correct) of ncRNAs... Maybe if I want to improve the UTR annotation the best way is just to remove the ncRNAs from the original GFF file and keep only the mRNAs. What do you think?
Thank you for your help
Finally, I observed that features like ncRNA or tRNA have the field "CDS" instead of "exon". That was probably the problem with my annotation. Thank you very much!
Glad you got it figured out! GFF is finicky format.
Hello, I am using funannotate update to have a more accurate UTR regions from a model organism using dRNA reads. It seems that does have a problem parsing the GenBank files that funannotate creates:
Command:
output:
The final lines of the logFile provide very similar information:
I will appreciate any help. O observed a similar question here but in my case, there is not any gene indicated.
Thank you very much for your help.