Malfoy / PAC

GNU General Public License v3.0
10 stars 3 forks source link

Warning the existing index use a 16 bits encoding. Input encoding ignored #5

Open shenwei356 opened 7 months ago

shenwei356 commented 7 months ago

Hi @Malfoy and @kamimrcht , I just want to have a quick test, but the querying did not work.

version: latest 940f18b.

Installation:

mamba create -n pac
mamba activate pac

mamba install -c conda-forge cxx-compiler zlib

git clone --depth 1 --recursive https://github.com/Malfoy/PAC.git
cd PAC

# manually edit file: zstr.hpp.
# change
#    #include <zlib.h>
# to
#    #include </homes/shenwei/.conda/envs/pac/include/zlib.h>

make -j 8

Indexing with 100 bacterial genomes.

PAC  -f <(fd gz$ test1k/) -d t_pac.1k -k 21 -c 16
************** PAC  ***************

I index 31mers with Bloom filters of size 134,217,728 with 1 hash functions  using 256 partitions 
Insert file of file /dev/fd/63
100 files found 
Bloom construction time: 7.39291s
1,424 MB RAM used
Exponential Bloom construction time: 0.500061s
Total Index time: 7.89297s
1,439 MB RAM used
Dump the index in /hps/nobackup/iqbal/shenwei/db/gtdb/t_pac.100

Querying with one of them.

$ PAC -l t_pac.100/ -q  <(fd gz$ test100/ | head -n 1)  -c 16 -o t.txt
************** PAC  ***************

Warning the existing index use a 16 bits encoding. Input encoding ignored

$ ls t.txt
ls: cannot access 't.txt': No such file or directory
Malfoy commented 6 months ago

Hi @shenwei356 ! Sorry it was a small parsing bug, it should work fine now!