althonos / pyrodigal

Cython bindings and Python interface to Prodigal, an ORF finder for genomes and metagenomes. Now with SIMD!
https://pyrodigal.readthedocs.org
GNU General Public License v3.0
141 stars 5 forks source link

different result comparing to prodigal #42

Open zzsunday opened 1 year ago

zzsunday commented 1 year ago

hello, something strange happens. when I use prokka, which uses prodigal to predict CDS,(prokka version 1.14.6) to annotate, the result showed that the position from 49243 to 50055 is entire CDS. but, when I using bakta, which uses pyrodigal to predict cds( bakta version 1.7.0) , it truncate the entire CDS into two different CDS. I also do it by blast, the result showed that the 49243 to 50055 100% match to a specific gene.

attached files is : prokka.gff : prokka result bakta.gff : bakta result ndm5.out: blastn result 19D44.fasta: sequence to annotate(query sequence) ndm5.fasta: the ndm5 gene sequence Archive.zip

althonos commented 1 year ago

Hi @zzsunday, Pyrodigal and Prodigal are expected to have different results unless you compiled Prodigal from source, because Pyrodigal fixed several bugs in Prodigal that affect the gene scoring. But otherwise, it was tested extensively before being integrated in bakta (see #21).

zzsunday commented 1 year ago

Thank you @althonos, but another question occured, when i used spades to do assembly, then use bakta to annotate. and I also extracted one specific contig named contig40 (the contig name in spades result was [Uploading Archive.zip…]() NODE_40_length_21390_cov_105.006) from spades assembly result and used bakta to annotate. interestingly, I got two different cds prediction to the same sequence( from position 5813 to 9110). attached file: 101.fasta: spades assembly result 101.gff3: spades assembly result annotated by bakta contig40.fasta(the name in spades assembly result was NODE_40_length_21390_cov_105.006): single contig extracted from spades assembly result contig40.gff3: single contig annotated by bakta

althonos commented 1 year ago

Were you running in meta or single mode?