Closed Aiswarya-prasad closed 3 years ago
Which centrifuge index are you using?
It's the p_compressed+h+v index from
ftp://ftp.ccb.jhu.edu/pub/infphilo/centrifuge/data/p_compressed+h+v.tar.gz
That index might be outdated. Can you try the newer one created by other researchers such as: https://zenodo.org/record/3732127/files/h+p+v+c.tar.gz?download=1 ?
That index might be outdated. Can you try the newer one created by other researchers such as: https://zenodo.org/record/3732127/files/h+p+v+c.tar.gz?download=1 ?
I will try this. Thank you. Where can I find more information about this index?
I found an important taxon (853) that is widely reported in many studies to be missing in the Centrifuge report.
In the output, those reads that Kraken2 had classified as taxID 853, are classified (2176/4994 classified by Kraken2) as seen below:
I could not find this taxon (853) in the database by using centrifuge-inspect and grep on the output.
These matches seem to have a good score and hitLength but do not correlate with Kraken2. Does this mean that they should be disregarded? I understand that it may not be easy to compare two tools like this especially since different databases are involved but this makes leaves me at a tough spot where I am unable to decide which results to go with especially since I know that this taxon has been widely reported by many 16S rRNA based studies (mine is nanopore shotgun data).
Also, this makes me worry that this may be happening with other taxons too.
This is an issue with centrifuge-1.0.3-beta.