Closed lakhujanivijay closed 6 years ago
Looks like the downloads of the files from the NCBI FTP server were interrupted, thus they cannot be unpacked properly.
The file sizes should be around 3.6G for prot.accession2taxid.gz
and 36G for nr.gz
.
Does that mean a "network hiccup" ?
I guess you could call it like that if you want to.. :) I have never seen those lines
2018-05-31 16:38:36 (69388554 GB/s) - Control connection closed.
So I assume it has something to do with your network connectivity..
HAHAHA :+1: I liked that comment ...!!
I will check my internet. Thank you so much Peter! By the way, I liked the tool
Nice to hear!
Hi Peter
It's me once again to bug you! I did not want to open a new issue , hence, posting here.
I was finally able to download the data , however, I am not able to find the .fmi
file. My folder looks like
citations.dmp
convert_mar_to_kaiju.py
delnodes.dmp
division.dmp
gbk2faa.pl
gc.prt
gencode.dmp
LICENSE
makeDB.sh
merged.dmp
names.dmp
nodes.dmp
nr.gz
prot.accession2taxid
prot.accession2taxid.gz
README.md
readme.txt
taxdump.tar.gz
taxonlist.tsv
The database size (as you mentioned above matches)
[corona] $ du -sh prot.accession2taxid.gz
3.6G prot.accession2taxid.gz
[corona]$ du -sh nr.gz
36G nr.gz
Appreciate your time! Let me know if this is not the appropriate place to ask such questions; do suggest an alternative in that case.
Regards Vijay
Ok, so you ran makeDB.sh -e
and it aborted so you didn't get the kaiju_db_nr_euk.fmi file? Then probably you need more RAM to complete it. It's a very big database, so probably needs 100GB RAM.
You can also download it from here: http://kaiju.binf.ku.dk/server (blue box)
Hi Peter
I hope you are doing well. I have downloaded the file you suggested above from this the link.
Here are the data sizes both files , compressed and extracted ones!
[headnode new]$ du -sh *
49G kaiju_index_nr_euk
28G kaiju_index_nr_euk.gz
After extraction, what I get a single binary file; however, I thought I will get a folder with few files, .dmp
and .fmi
files. Could you please help?
Regards Vijay Lakhujani
It looks like you unpacked the file with the wrong command. You need to use:
tar xzf kaiju_index_nr_euk.tgz
which will give you the files
kaiju_index_nr_euk.fmi
names.dmp
nodes.dmp
Oh! How foolish of me. Indeed I made a mistake!
Thanks man, you are great..!! Let me try running it now.
Hi
I am trying to download the reference set data for standalone KAIJU by following the steps on this link
My command is
However, after waiting for a long time, I am getting below error: