Open samkleeman1 opened 2 years ago
Hi @samkleeman1, I think diamond ran out of memory with this genome. But I also think there was a bug that was fixed in the master that will help you, https://github.com/nextgenusfs/funannotate/commit/e74f867c9eec614f4b15504562789eb5fc51da21 is a fix for diamond version parsing and running diamond a slightly different way that uses less memory which is helpful with larger genomes.
You could upgrade your current installation with pip, ie
python -m pip install git+https://github.com/nextgenusfs/funannotate.git --upgrade --force --no-deps
And then can re-issue the same command and see if that fixes the problem.
You should just need the latest docker image, so run docker pull nextgenusfs/funannotate and then try command again if that doesn’t run the frame shift mode docker than it might not be fixed. The docker image is automatically updated after every commit using GitHub actions, so as long as you update your local image you will have all changes.
On Dec 23, 2021, at 12:04 PM, Rolando Perez @.***> wrote:
Hello @nextgenusfs, I just got a similar error while using funannotate-docker. Would upgrading via above command work for my fun annotate-docker run?
Thanks and happy holidays!
/rp
[Dec 23 11:55 AM]: OS: Debian GNU/Linux 10, 8 cores, ~ 2 GB RAM. Python: 3.8.12 [Dec 23 11:55 AM]: Running funannotate v1.8.10 [Dec 23 11:55 AM]: GeneMark not found and $GENEMARK_PATH environmental variable missing. Will skip GeneMark ab-initio prediction. [Dec 23 11:55 AM]: Skipping CodingQuarry as no --rna_bam passed [Dec 23 11:55 AM]: Parsed training data, run ab-initio gene predictors as follows: Program Training-Method augustus busco glimmerhmm busco snap busco [Dec 23 11:55 AM]: Loading genome assembly and parsing soft-masked repetitive sequences [Dec 23 11:58 AM]: Genome loaded: 11,706 scaffolds; 42,661,187 bp; 6.16% repeats masked [Dec 23 11:58 AM]: Mapping 553,202 proteins to genome using diamond and exonerate [Dec 23 11:59 AM]: CMD ERROR: diamond blastx --threads 8 -q /Users/rcp/fun2/predict_misc/genome.softmasked.fa --db diamond -o diamond.matches.tab -e 1e-10 -k 0 --more-sensitive --unal 0 -c 1 -F 15 -f 6 sseqid slen sstart send qseqid qlen qstart qend pident length evalue score qcovhsp qframe b'diamond v2.0.13.151 (C) Max Planck Society for the Advancement of Science\nDocumentation, support and updates available at http://www.diamondsearch.org\nPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)\n\n#CPU threads: 8\nScoring parameters: (Matrix=BLOSUM62 Lambda=0.267 K=0.041 Penalties=11/1)\nTemporary directory: \n#Target sequences to report alignments for: unlimited\nOpening the database... [0.062s]\nDatabase: diamond.dmnd (type: Diamond database, sequences: 553202, letters: 203792478)\nBlock size = 2000000000\nOpening the input file... [0.205s]\nOpening the output file... [0s]\nLoading query sequences... [0.379s]\nMasking queries... [0.595s]\nAlgorithm: Double-indexed\nBuilding query histograms... [1.663s]\nAllocating buffers... [0s]\nLoading reference sequences... [0.353s]\nMasking reference... [1.375s]\nInitializing temporary storage... [0s]\nBuilding reference histograms... [4.023s]\nAllocating buffers... [0s]\nProcessing query block 1, reference block 1/1, shape 1/16.\nBuilding reference seed array... [3.119s]\nBuilding query seed array... ' Traceback (most recent call last): File "/venv/bin/funannotate", line 8, in sys.exit(main()) File "/venv/lib/python3.8/site-packages/funannotate/funannotate.py", line 710, in main mod.main(arguments) File "/venv/lib/python3.8/site-packages/funannotate/predict.py", line 1051, in main lib.exonerate2hints(Exonerate, hintsP) File "/venv/lib/python3.8/site-packages/funannotate/library.py", line 4026, in exonerate2hints with open(file, 'r') as input: FileNotFoundError: [Errno 2] No such file or directory: '/Users/rcp/fun2/predict_misc/protein_alignments.gff3'
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.
So far it seems my issue was the memory for the docker VM was set at 2 GB. Thank you!
Hi,
Many thanks for making this package available! I had an error as part of the funannotate predict step - it appears to be one that is mentioned in a previous issue #517 - however, I have the latest version of Diamond and I am still seeing the error.
Are you able to advise how to proceed.
With best wishes,
Sam Kleeman PhD Student Cold Spring Harbor Laboratory, NY