build_db: compact hash table capacity exceeded

Hello

I am running into this error, I googled it and no-one else seems to have run into it, so I thought I would raise an issue.

I am building a very small test database, but crucially I have edited (i.e. added) additional rows to the names.dmp and nodes.dmp file, because I want to use the NCBI taxonomy, but add some custom species to it.

The command is simply:

kraken2-build --threads 16 --build --db test

The output is:

Creating sequence ID to taxonomy ID map (step 1)...
Sequence ID to taxonomy ID map already present, skipping map creation.
Estimating required capacity (step 2)...
Estimated hash table requirement: 10240 bytes
Capacity estimation complete. [0.017s]
Building database files (step 3)...
Taxonomy parsed and converted.
CHT created with 4 bits reserved for taxid.
build_db: compact hash table capacity exceeded

Within test both seqid2taxid.map and taxo.k2d.tmp have at least begun to be created. Within test/taxonomy then prelim_map.txt has been created

Of course I have no idea if the error message is related to my editing of nodes.dmp and names.dmp!

I am building on a server with 16x16Gb configuration.

Kraken2 installed from bioconda

Cheers Mick

DerrickWood / kraken2

build_db: compact hash table capacity exceeded #321