Closed Cscharlotte closed 2 weeks ago
hi @Cscharlotte, thanks for your interest in the package!
The issue you are encountering is due to an old release version of igblast
(and likely blast
too) that you have on your system. c_region_db
was added from igblast v1.18.0
~3 years ago
https://ncbi.github.io/igblast/rel/Release-notes.html
the latest versions of igblast
and blast
are 1.22.0 and 2.16.0+
respectively. Just update your igblast and blast and they should hopefully work?
I have upgraded both, the error above got solved, but now getting errors as below:
Assigning genes : 0%| | 0/1 [00:00<?, ?it/s] BLAST query/options error: Germline annotation database human/human_TR_V could not be found in [internal_data] directory Please refer to the BLAST+ user manual. BLAST query/options error: Germline annotation database human/human_TR_V could not be found in [internal_data] directory Please refer to the BLAST+ user manual. ERROR> Input 10kPBMC_CFCOV/dandelion/tmp/all_contig_igblast.fmt7 does not exist.
ERROR> Input 10kPBMC_CFCOV/dandelion/tmp/all_contig_igblast.fmt7 does not exist.
BLAST Database error: Database memory map file error
BLAST Database error: Database memory map file error
Assigning genes : 100%|██████████| 1/1 [00:09<00:00, 9.30s/it]
Traceback (most recent call last):
File "
I have noticed similar issues mentioned in tracer repo and recently in #382. I tried to specify igblast_db parameter either as internal_data in the container folder or home/miniconda3/pkgs/igblast-1.22.0-h6a68c12_1/bin, but it did not work. Can you provide detailed guidance on how to solve this problem? Thanks.
can you try something like:
ddl.pp.reannotate_genes(
folder,
igblast_db="path/to/database/igblast",
germline="path/to/database/germlines/imgt/human/vdj",
)
one more thing to add that if it still complains that the index are outdated, you might want to use the container script to prepare the database: https://github.com/zktuong/dandelion/blob/master/container/scripts/prepare_imgt_database.py
or ideally, if you can run the singularity container, all of this will be trivialised - it doesn't need anndata for the preprocessing
Defining germline path unfortunately did not work for me, I re-read the replies in https://github.com/Teichlab/tracer/issues/48. The solution works for me now is to download the internal_data folder from NCBI and put it in the /bin folder of igblast executable. After doing this, running the following command works: ddl.pp.reannotate_genes(samples, loci="tr", reassign_dj=True, filename_prefix="all",igblast_db='home/dandelion/Github/dandelion/container/database/igblast').
I suspect the issue was caused by the missing internal_data folder in the igblast executable directory. This folder seems normally be included with the igblast installation starting from version 1.13.0. However, my conda installation of igblast (though version 1.22+) did not include this folder for some reason, which likely caused the problem. Anyways thank you for taking the time to address all the issues!
glad that it worked out eventually!
Description of the bug
Hi,
Thank you for the development of dandelion package! I am trying to reannotate my 10x TCR data (only one dataset for testing), while there was some error raised which I cannot see the reasons for. Code is shown below. And my sample folder was saved as '~/dandelion/dandelion_data/10kPBMC_CFCOV' which contains 'all_contig_annotations.csv' and 'all_contig.fasta'.
The dandelion I am using is the current Github developer version to avoid conflict with anndata in the 0.3.8 release version.
Minimal reproducible example
The error message produced by the code above
OS information
Linux
Version information
dandelion==0.3.9.dev4 pandas==2.2.2 numpy==1.26.4 matplotlib==3.9.2 networkx==2.7 scipy==1.14.1
Additional context
I have retried in a new window, and now instead of the errors with asterisks above, I got current error. Seems like a blast and igblast confusion?
BLAST Database error: No alias or index file found for nucleotide database [home/dandelion/Github/dandelion/container/database/igblast/database/imgt_human_tr_j] in search path [home/dandelion:home/dandelion/Github/dandelion/container/database/blast:] BLAST Database error: No alias or index file found for nucleotide database [home/dandelion/Github/dandelion/container/database/igblast/database/imgt_human_tr_d] in search path [home/dandelion:home/dandelion/Github/dandelion/container/database/blast:]