aldertzomer / vibrio_parahaemolyticus_genomoserotyping

Kaptive database of O and K antigen clusters of Vibrio parahaemolyticus
GNU General Public License v3.0
7 stars 2 forks source link

Does not work with current verison of kaptive #3

Open erinyoung opened 3 months ago

erinyoung commented 3 months ago

I was updating kaptive to version 3.0.0b1, and noticed that this version doesn't appear to work with these gbk files anymore.

What I ran:

wget -q https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/558/495/GCF_001558495.2_ASM155849v2/GCF_001558495.2_ASM155849v2_genomic.fna.gz
wget -q https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/728/135/GCF_001728135.1_ASM172813v1/GCF_001728135.1_ASM172813v1_genomic.fna.gz
gzip -d GCF_001558495.2_ASM155849v2_genomic.fna.gz
gzip -d GCF_001728135.1_ASM172813v1_genomic.fna.gz
kaptive assembly VibrioPara_Kaptivedb_K.gbk *.fna -o /test3/Vparahaemolyticus 

The error I observed:

Traceback (most recent call last):
   File "/usr/local/bin/kaptive", line 8, in <module>
     sys.exit(main())
   File "/usr/local/lib/python3.9/site-packages/kaptive/__main__.py", line 293, in main
     if (result := typing_pipeline(
   File "/usr/local/lib/python3.9/site-packages/kaptive/assembly.py", line 163, in typing_pipeline
     for_scoring = [(
   File "/usr/local/lib/python3.9/site-packages/kaptive/assembly.py", line 164, in <listcomp>
     db.loci[locus], list(a)) for locus, a in groupby(  # Group alignments by locus
 KeyError: 'KL155-1'

Are you going to be updating your files?

aldertzomer commented 3 months ago

@tomdstanton I think the Kaptive database format has changed for Kaptive3. Can you tell use the necessary changes to make it work again?

tomdstanton commented 3 months ago

Hi both, This shouldn't be due to the v3 update, but I think is due to the issue flagged in #1. See my fix in PR #4

aldertzomer commented 3 months ago

ah I missed the fix. I just uploaded a fixed version with KL155-1 instead of KL1155-1