oushujun / EDTA

Extensive de-novo TE Annotator
https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1905-y
GNU General Public License v3.0
315 stars 70 forks source link

Difference in metrics between sum and gff3 files #331

Closed clauderispe closed 1 year ago

clauderispe commented 1 year ago

Dear Shujun I notice a difference between the summarized metrics reported in the file genome.mod.EDTA.TEanno.sum and the genome.EDTA.TEanno.gff3 files. The sum files indicates a total count of 3184990 elements while there are 3939911 lines in the gff3. And for example, the sum file reports 3239 Copia elements, whereas there are 3785 lines with "Copia" in the gff3. Is there an explanation for this pattern? Could it be related with the error messages I reported in a previous issue (#328 , program runs to the end but reports errors for call_seq_by_list2.pl, and oom-kill event(s)) ? Thanks for your feedback! Claude

oushujun commented 1 year ago

Hi Claude,

Structural intact elements will have multiple lines for different features. Please find more information in the wiki page.

Best, Shujun

On Mon, Feb 6, 2023 at 4:15 AM clauderispe @.***> wrote:

Dear Shujun I notice a difference between the summarized metrics reported in the file genome.mod.EDTA.TEanno.sum and the genome.EDTA.TEanno.gff3 files. The sum files indicates a total count of 3184990 elements while there are 3939911 lines in the gff3. And for example, the sum file reports 3239 Copia elements, whereas there are 3785 lines with "Copia" in the gff3. Is there an explanation for this pattern? Could it be related with the error messages I reported in a previous issue (#328 https://github.com/oushujun/EDTA/issues/328 , program runs to the end but reports errors for call_seq_by_list2.pl, and oom-kill event(s)) ? Thanks for your feedback! Claude

— Reply to this email directly, view it on GitHub https://github.com/oushujun/EDTA/issues/331, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABNX4NFLU3NUHPSW4QFZDL3WWC6LFANCNFSM6AAAAAAUSNM5GM . You are receiving this because you are subscribed to this thread.Message ID: @.***>