Open acvill opened 1 month ago
Hi @acvill ,
This is a very weird phage! Essentially, the error exists because Phold (fold seek) found 0 hits. My suspicion is that this is due to phanotate providing crappy gene calls. There were 179 CDS in your GenBank, whereas prodigal found only 90 in the paper with prokka (https://journals.asm.org/doi/10.1128/spectrum.03719-23). I will look into this separately for other reasons, because that is bizarre!
In the dev branch, I have added a line of code to warn and exit the users if Foldseek finds 0 hits - an unlikely occurrence but nonetheless possible.
George
Thanks for looking into this @gbouras13 ! A weird phage indeed...
Thanks for making and maintaining pharokka and phold!
Description
I'm running pharokka -> phold on a set of 853 complete (single-contig) phage genomes. phold gives a
per_cds_predictions.tsv
file for all phage except one: Pseudomonas phage PIP. After rerunning a few times, this does not appear to be a memory issue. Perhaps an interesting edge case?What I Did
Please find the original fasta file, the pharokka gbk file, my conda yml files, and all the relevant log files at this Dropbox link:
https://www.dropbox.com/scl/fo/29tp5my2me718pr6wwb5c/AH8ggBgegenkhEhSvlZGdSs?rlkey=ivmz8w6zkdr7i3v89013ocwth&st=fcppkjqi&dl=0
Error traceback