mgalardini / pyseer

SEER, reimplemented in python 🐍🔮
http://pyseer.readthedocs.io
Apache License 2.0
104 stars 25 forks source link

IndexError-list index out of range #145

Closed buihoangphuc412 closed 3 years ago

buihoangphuc412 commented 3 years ago

I ran this command annotate_hits_pyseer significant_kmers_amox.txt references.txt annotated_kmers_amox.txt and get error: 100 kmers remain Reference 1 [main] Version: 0.7.17-r1188 [main] CMD: bwa mem -v 1 -k 8 CHC97.fasta /media/sf_Biolinux/Antibiotic project/pyseer/remaining_kmers.fa [main] Real time: 0.125 sec; CPU: 0.114 sec Traceback (most recent call last): File "/home/manager/miniconda3/envs/test/bin/annotate_hits_pyseer", line 10, in sys.exit(main()) File "/home/manager/miniconda3/envs/test/lib/python3.7/site-packages/pyseer/kmer_mapping/annotate_hits.py", line 179, in main up_genes = extract_genes(sorted_query.closest(b=ref_annotation, s=False, D="ref", iu=True, stream=True)) File "/home/manager/miniconda3/envs/test/lib/python3.7/site-packages/pyseer/kmer_mapping/annotate_hits.py", line 52, in extract_genes for tag in match.fields[15].split(";"): IndexError: list index out of range

I attached files for running references.txt significant_kmers_amox.txt

johnlees commented 3 years ago

@mgalardini The range error seems common in this script, whenever bwa gives something with surprising formatting. Maybe we should put it in a try/catch?

mgalardini commented 3 years ago

Agreed! Will try to implement something in the next couple of days, we are due a new version as well I think

mgalardini commented 3 years ago

@buihoangphuc412 would you mind sharing the fasta and gff file for sample CHC97?

Thanks

buihoangphuc412 commented 3 years ago

I attached the files. Thanks

Vào Th 3, 27 thg 4, 2021 vào lúc 19:14 Marco Galardini < @.***> đã viết:

@buihoangphuc412 https://github.com/buihoangphuc412 would you mind sharing the fasta and gff file for sample CHC97?

Thanks

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/mgalardini/pyseer/issues/145#issuecomment-827490167, or unsubscribe https://github.com/notifications/unsubscribe-auth/AITHPCS5TJN3GJUVKSUABE3TK2FATANCNFSM43KKS3OQ .

mgalardini commented 3 years ago

Sorry, I don't see them, you may need to attach them by replying to the message using Github's interface

buihoangphuc412 commented 3 years ago

CHC97.zip

mgalardini commented 3 years ago

Which version of pyseer are you using? I can't reproduce the error using pyseer v1.3.4 or from the latest commit

buihoangphuc412 commented 3 years ago

I used the new version pyseer --version pyseer 1.3.6

mgalardini commented 3 years ago

Tried that too, but I'm not encountering any error. Let's make sure we have no other version mismatch; here's the version of the other relevant libraries/software:

My output (truncated before it fails because I don't have the other references):

~/workspace/pyseer/issue_145$ annotate_hits_pyseer significant_kmers_amox.txt references.txt output
100 kmers remain                                                                                         
Reference 1                                                                                              
[main] Version: 0.7.17-r1188                                                                             
[main] CMD: bwa mem -v 1 -k 8 CHC97.fasta /home/m/workspace/pyseer/issue_145/remaining_kmers.fa
[main] Real time: 0.060 sec; CPU: 0.061 sec
[M::bwa_idx_load_from_disk] read 0 ALT contigs                                                                                                                                                                     
[main] Version: 0.7.17-r1188
[main] CMD: bwa fastmap -w 100 -l 9 CHC97.fasta /home/m/workspace/pyseer/issue_145/remaining_kmers.fa
[main] Real time: 0.008 sec; CPU: 0.010 sec                                                              
71 kmers remain
Draft reference 2
mgalardini commented 3 years ago

Closing for lack of follow-up replies and could not reproduce