mourisl / centrifuger

Classifier for metagenomic sequences using FM-index with run-block compressed BWT.
MIT License
39 stars 3 forks source link

pre-built index files issue #14

Closed Wang-Jing-NZ closed 1 day ago

Wang-Jing-NZ commented 2 days ago

We downloaded the pre-compiled index file and tried to run with the files in the example folder. It did not produce expected results, an empty output file. We also tried with our own data, Illumina and ONT, still same empty outputs. We don't see any error messages. Can you get some help on this? Any ideas on the possible cause of this issue? Many thanks.

mourisl commented 1 day ago

What was your running command? Which index are you using, refseq or gtdb?

Wang-Jing-NZ commented 1 day ago

the pre-compile index is the one from Zenodo, https://zenodo.org/records/10023239. centrifuger 1.0.4 is loaded as a module the cmd: module load centrifuger/1.0.4 centrifuger -1 example_1.fq -2 example_2.fq -x /share/databases/centrifuger/db/cfr_hpv+gbsarscov2 > example.tsv many thanks!

mourisl commented 1 day ago

What is the output on the screen, Like finishing load of index? How much memory did you allocate for this job? You may need at least about 50G memory to classify against this database. If it is not the memory issue, can you use centrifuger-inspect (if it is in the module) with --summary to see whether it can read in the index file?

Wang-Jing-NZ commented 1 day ago

no, I also tried with reference provided in the example folder and build the index from it. It works. centrifuger -1 example_1.fq -2 example_2.fq -x cfr_ref_idx > example_example_index.tsv [Tue Sep 17 13:13:03 2024] Centrifuger v1.0.4-r153 starts. [Tue Sep 17 13:13:03 2024] Finishes loading index. [Tue Sep 17 13:13:03 2024] Inferred --min-hitlen: 23 [Tue Sep 17 13:13:03 2024] Centrifuger finishes. use the command above, just one line output, [Tue Sep 17 13:19:37 2024] Centrifuger v1.0.4-r153 starts. -only one line of the output, no msg for loading the index. (copied outputs from yersterday's chat with our Linux administrator who has been helped me to get this up running, )

mourisl commented 1 day ago

I think the index loading step failed. This could happen due to not enough memory is allocated and the job is killed.

Wang-Jing-NZ commented 1 day ago

thanks a lot! we just tried all centrifuger-inspect commands, no outputs from any of the cmd, i.e., centrifuger-inspect --summary -x share/databases/centrifuger/db/cfr_hpv+gbsarscov2 how to set memory in centrifuger? Our Linux administrator noticed it in centrifuger-build.Here is a prebuilt index, how do set it when run centrifuger ?

mourisl commented 1 day ago

Hmmm, the memory usage should be quite low when running centrifuger-inspect with --summary. So the issue probably is not relate to memory. Can you show me the sizes of the index files you downloaded?

Wang-Jing-NZ commented 1 day ago

total 48044504 -rw-r--r-- 1 root root 45035606741 Sep 16 04:55 cfr_hpv+gbsarscov2.1.cfr -rw-r--r-- 1 root root 5054329 Sep 16 07:37 cfr_hpv+gbsarscov2.2.cfr -rw-r--r-- 1 root root 1514688 Sep 16 07:37 cfr_hpv+gbsarscov2.3.cfr does it help?

mourisl commented 1 day ago

The files look fine to me. This is very strange.

we just tried all centrifuger-inspect commands, no outputs from any of the cmd, i.e., centrifuger-inspect --summary -x share/databases/centrifuger/db/cfr_hpv+gbsarscov2`

The "share/databases/centrifuger/db/cfr_hpv+gbsarscov2" misses a "/" in the beginning. Is it a typo here or a typo when you ran centrifuger-inspect?

russellsmithies commented 1 day ago

gbsarscov2" misses a "/" in the beginning. Is it a typo here or a typo when you ran centrifuger-inspect?

All sorted - we were running it as a conda env inside a singularity container and the wrapper script had a symlink in it that wasn't being followed.

--Russell (Linux admin)

Wang-Jing-NZ commented 1 day ago

oh, a typo, sorry, Not the path I used to run the cmd.

Wang-Jing-NZ commented 1 day ago

thank you both for all the help here. Really appreciated!

mourisl commented 1 day ago

I'll close this issue for now. Feel free to reopen it or create another thread if you find any issues with Centrifuger.