Closed bramvandijk88 closed 3 years ago
Hello,
this is the same issue as in issues #85 and #70. You can read from https://github.com/gem-pasteur/Integron_Finder/issues/85#issuecomment-839942307
Long story short, it is fixed if you install IntegronFinder from the master branch and not from the release candidate 2.0rc6.
See here how to do that : https://github.com/gem-pasteur/Integron_Finder/issues/85#issuecomment-850339451
Hi again @bramvandijk88 ,
Concerning your question :
Especially the third column seems to confuse me, which appears to contain the names of other contigs in the same file? What does that mean? Is the evalue column related to that?
The third column element
is the main column of this table (b/c this table has one element per line). An element can be a protein, an attC site, an attI site, a promoteur, etc... Proteins identifiers are those produced by prodigal, and are basically ID_contig_1
with 1 being the protein numbering done by prodigal along the sequence. Because of the bug, you have proteins from other contigs that are aggregated with the contigs where there is an integron, hence your confusion.
The evalue is given for attC or intI, or for protein when using functional annotation, others have NA
.
I hope it clears your confusion up ! Best
Version of Integron_Finder:
OS
I need some help making sense of the output of IF. I'm simply not sure what I'm looking at, and failed to find documentation on what each column means. I ran integrond finder (see command below) on a simple multi-fasta file, which is a metagenomic bin derived from a compost sample (Thiopseudomonas denitrificans). I have many other bins, but this was one of the better ones (checkm reports it as 100% complete and 0.0% contaminated).
Alright, here's the first 10 lines of the integron_finder_summary:
This reports the integron found on contig "NODE_8" is complete. So then getting the (first 12) lines matching that integron from the output of integron_finder_results:
For some of the columns, it is rather obvious what they mean (ID_integron, type_elt, distance_2attC, strand), but for many other I have no clue. Especially the third column seems to confuse me, which appears to contain the names of other contigs in the same file? What does that mean? Is the evalue column related to that?
Thanks for the help!
Best,
Bram