sagnikbanerjee15 / Finder

A fully automated gene annotator from RNA-Seq expression data
MIT License
53 stars 14 forks source link

Error: EXITING: Did not find the genome in memory, did not remove any genomes from shared memory #17

Open lenshinaagbor opened 3 years ago

lenshinaagbor commented 3 years ago

Hi, I just came across this software as I have been struggling with MAKER. Running the example data and my data, I have encountered this same isssue. It says it can't find the genome but it's right there.

EXITING: Did not find the genome in memory, did not remove any genomes from shared memory This is what is on the log.out file;

cat Log.out STAR version=2.7.7a STAR compilation time,server,dir=Mon Dec 28 13:38:40 EST 2020 vega:/home/dobin/data/STAR/STARcode/STAR.master/source

Command Line: STAR --runThreadN 30 --genomeLoad Remove --genomeDir /mnt/nfs/home/b9017460/finder/example/star_index_without_transcriptome

Initial USER parameters from Command Line: All USER parameters from Command Line: runThreadN 30 ~RE-DEFINED genomeLoad Remove ~RE-DEFINED genomeDir /mnt/nfs/home/b9017460/finder/example/star_index_without_transcriptome ~RE-DEFINED

Finished reading parameters from all sources Final user re-defined parameters-----------------: runThreadN 30 genomeDir /mnt/nfs/home/b9017460/finder/example/star_index_without_transcriptome genomeLoad Remove

Final effective command line: STAR --runThreadN 30 --genomeDir /mnt/nfs/home/b9017460/finder/example/star_index_without_transcriptome --genomeLoad Remove Number of fastq files for each mate = 1 Finished loading and checking parameters Reading genome generation parameters:

STAR --runMode genomeGenerate --runThreadN 30 --genomeDir star_index_without_transcriptome --genomeFastaFiles Arabidopsis_thaliana.TAIR10.dna_sm.toplevel.fa --genomeSAindexNbases 12 GstrandBit=32 versionGenome 2.7.4a ~RE-DEFINED genomeType Full ~RE-DEFINED genomeFastaFiles Arabidopsis_thaliana.TAIR10.dna_sm.toplevel.fa ~RE-DEFINED genomeSAindexNbases 12 ~RE-DEFINED genomeChrBinNbits 18 ~RE-DEFINED genomeSAsparseD 1 ~RE-DEFINED genomeTransformType None ~RE-DEFINED genomeTransformVCF - ~RE-DEFINED sjdbOverhang 0 ~RE-DEFINED sjdbFileChrStartEnd - ~RE-DEFINED sjdbGTFfile - ~RE-DEFINED sjdbGTFchrPrefix - ~RE-DEFINED sjdbGTFfeatureExon exon ~RE-DEFINED sjdbGTFtagExonParentTranscripttranscript_id ~RE-DEFINED sjdbGTFtagExonParentGene gene_id ~RE-DEFINED sjdbInsertSave Basic ~RE-DEFINED genomeFileSizes 120586240 985722733 ~RE-DEFINED Genome version is compatible with current STAR Number of real (reference) chromosomes= 7 1 1 30427671 0 2 2 19698289 30670848 3 3 23459830 50593792 4 4 18585056 74186752 5 5 26975502 92798976 6 Mt 366924 119799808 7 Pt 154478 120324096 Started loading the genome: Fri Jun 18 10:22:22 2021

Genome: size given as a parameter = 120586240 SA: size given as a parameter = 985722733 SAindex: size given as a parameter = 1 Read from SAindex: pGe.gSAindexNbases=12 nSAi=22369620 nGenome=120586240; nSAbyte=985722733 GstrandBit=32 SA number of indices=238963086

EXITING: Did not find the genome in memory, did not remove any genomes from shared memory

Jun 18 10:22:22 ...... FATAL ERROR, exiting

Please help. I have tried adding the location but still not working :( I have been trying to annotate a genome for a very long time now but keep encountering errors.

Best, Lenshina

sagnikbanerjee15 commented 3 years ago

Hi Lenshina,

Thank you for your interest in finder. The "error" that you see is not actually an error. Before finder kicks off the SRA data download, it will attempt to remove any lingering STAR images on the RAM. When the error says "EXITING: Did not find the genome in memory, did not remove any genomes from shared memory" it means that it cannot find the image in the RAM. Please ignore the message and let finder run. Let me know if you encounter any further issues.

Thank you.

lenshinaagbor commented 3 years ago

Hi Sagnik,

Thanks for your response. Indeed, Finder ran with the example data and my data but I still ended up with an error message and empty folders. I am using our cluster and cannot say what I'm not doing right and it is very frustrating. In brief, I am trying to annotate the genome of Trichomonas tenax (link to genome: https://sra-download.ncbi.nlm.nih.gov/traces/wgs01/wgs_aux/OC/TD/OCTD01/OCTD01.1.fsa_nt.gz) (RNAseq data; PRJNA340083, SRR4063857, 1 ILLUMINA (Illumina HiSeq 1500) run)) (link to annotated protein sequences of related species;https://trichdb.org/common/downloads/Current_Release/TvaginalisG3/fasta/data/TrichDB-52_TvaginalisG3_AnnotatedProteins.fasta).

The error files I am getting are as follows; braker error; ERROR in file /mnt/storage/nobackup/b9017460/finder/dep/BRAKER/scripts/braker.pl at line 6645 Failed to execute: perl /mnt/storage/nobackup/b9017460/finder/dep/gmes_linux_64/gmes_petap.pl --verbose --sequence=/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/genome.fa --ET=/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/genemark_hintsfile.gff --cores=30 --gc_donor 0.001 --soft_mask auto 1>/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/GeneMark-ET.stdout 2>/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/errors/GeneMark-ET.stderr Failed to execute: perl /mnt/storage/nobackup/b9017460/finder/dep/gmes_linux_64/gmes_petap.pl --verbose --sequence=/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/genome.fa --ET=/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/genemark_hintsfile.gff --cores=30 --gc_donor 0.001 --soft_mask auto 1>/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/GeneMark-ET.stdout 2>/mnt/storage/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/errors/GeneMark-ET.stderr The most common problem is an expired or not present file ~/.gmkey!

The error file;

EXITING: Did not find the genome in memory, did not remove any genomes from shared memory Jun 20 07:08:08 ...... FATAL ERROR, exiting EXITING: Did not find the genome in memory, did not remove any genomes from shared memory Jun 20 09:29:58 ...... FATAL ERROR, exiting EXITING: Did not find the genome in memory, did not remove any genomes from shared memory Jun 20 09:30:45 ...... FATAL ERROR, exiting /mnt/nfs/home/b9017460/finder/scripts/fixOverlappingAndMergedTranscripts.py:347: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when creating the ndarray coverage_info[transcript_id]["bed_cov"]=np.array(temp) rm: cannot remove ‘/nobackup/b9017460/finder/Tenax/FINDER_test_tenax/assemblies_psiclassmodified/combined/outputfileforCPD*’: No such file or directory Command line was: gffread -E /nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/braker.gff3 -T -o /nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/braker.gtf Error: cannot open input file /nobackup/b9017460/finder/Tenax/FINDER_test_tenax/braker/braker.gff3! Traceback (most recent call last): File "/mnt/nfs/home/b9017460/finder/finder", line 668, in main() File "/mnt/nfs/home/b9017460/finder/finder", line 649, in main removeSpuriousTranscriptsBasedOnCDS(options,logger_proxy,logging_mutex) File "/mnt/nfs/home/b9017460/finder/scripts/removeSpuriousTranscriptsBasedOnCDS.py", line 53, in removeSpuriousTranscriptsBasedOnCDS all_transcripts_info_gene_to_transcripts[gene_id][transcript_id]["confidence"]="repeat_region" KeyError: 'OCTD01000001.1.1'

Please, can you help?

Thanks, Lenshina

sagnikbanerjee15 commented 3 years ago

Hello Lenshina,

I completely understand how frustrating it can be to encounter issues like this. There seems to be an issue with braker. Sometimes braker fails on small genomes and fragmented chromosomes. Thank you for providing the data you used for annotation. I will rerun it on my end and see if I can replicate the error. I might have to include an option to skip the braker run for certain genomes. Please hang tight we will get through this!

Thank you.

lenshinaagbor commented 3 years ago

Thank you very much :)