Closed antgonza closed 2 years ago
This is expected behavior. The taxonomy ID reported in the output is of the Blastomyces genus.
The NCBI taxonomy is hierarchical, and the species taxIDs are children of genus taxIDs. Since there is conflicting evidence at the species level about which Blastomyces is present, eukdetect does not report a specific species but rather considers there to be enough evidence that there is something in the Blastomyces genus present.
More info is in the methods section of the paper.
Thank you for the explanation, I think it makes sense.
Now, is there an easy way to reconcile those values without having to go to https://www.ncbi.nlm.nih.gov/Taxonomy/Browser? I guess something similar to marker_genes_per_species.csv
but for all taxonomic levels?
Take a look at the output in the {samplename}_filtered_hits_taxonomy.txt files - those provide the markers observed, number of reads observed, and other metrics for every taxonomic level above everything reported in the final filtered output.
Hello,
I ran my samples through the EukDetect pipeline using the latests bowtie2/taxonomy files and everything went fine - thank you for making it so easy to run.
Anyway, I was looking to reconcile the
*_filtered_hits_table.txt
with the taxonomies found inmarker_genes_per_species.csv
and realized that at first glance my files are missing a few Taxid:5579
,5658
,29901
,229219
. Is this expected and/or am I missing something?Just to give a little more details about this issue, one of my tables has these info:
and when I look for
229219
inmarker_genes_per_species.csv
nothing is found and where I look forBlastomyces
, I actually found these but that Taxid is not present:another example
29901
, in one of my tables:but not found:
Thank you!