Closed Musketeer-D closed 3 weeks ago
Hi, sorry about that, looks like it is out of memory - do you know how large the non redundant transcriptome file is? I will try to recreate the error. Is 32GB the maximum you have available, if not could you try with more?
I am using the human genome GRCh38. I think the non-redundant transcriptome file is about 429M, as you can see: ll -ht /Users/pricedavid/dataofdavid/code/genome/Homo_sapiens.GRCh38.cdna.all.fa -rw-rw-rw-@ 1 pricedavid staff 429M Dec 7 12:54 /Users/pricedavid/dataofdavid/code/genome/Homo_sapiens.GRCh38.cdna.all.fa
I cannot run wf-transcriptomes with or without the mem.config parameter (I added the mem.config option as you advised in another issue here).
I ran wf-transcriptomes on a DELL server, which has 256GB RAM and 10TB ROM (no other job was running at the time I ran wf-transcriptomes), so I am quite confused as to why wf-transcriptomes could run out of memory.
Thank you for your kind help !
Hi, the file you shared isn't the non redundant transcriptomes, are you able to navigate to the work directory /mnt/32ac0a57-4519-4a01-8234-37c7fb4537e7/sgbio/deg/work/62/ef8862158c3644155267281a76e7a5
and check the file size. We have some memory improvements for minimap2 that we will apply in a future release.
If you are only interested in looking at differential expression of the transcriptome Homo_sapiens.GRCh38.cdna.all.fa
you could select the --transcriptome_source precomputed
parameter, which will bypass creating the new transcriptome (which can often be large).
Closing through lack of response
Operating System
Ubuntu 22.04
Other Linux
No response
Workflow Version
v1.1.1
Workflow Execution
Command line
EPI2ME Version
No response
CLI command run
/root/software/nextflow run epi2me-labs/wf-transcriptomes \ --fastq /mnt/32ac0a57-4519-4a01-8234-37c7fb4537e7/sgbio/batch1/fastq_pass \ --de_analysis --ref_genome /mnt/32ac0a57-4519-4a01-8234-37c7fb4537e7/sgbio/Homo_sapiens.GRCh38.dna.primary_assembly.fa \ --transcriptome-source reference-guided \ --ref_annotation /mnt/32ac0a57-4519-4a01-8234-37c7fb4537e7/sgbio/Homo_sapiens.GRCh38.110.chr.gff3 \ --minimap2_index_opts '-k 15' --sample_sheet ./sample_sheet-1.csv \ --threads 1 -c mem.config --cdna_kit SQK-PCS109 --out_dir /mnt/32ac0a57-4519-4a01-8234-37c7fb4537e7/sgbio/deg/test date
mem.config process { withName: build_minimap_index_transcriptome { memory = 32.GB } }
Workflow Execution - CLI Execution Profile
None
What happened?
Command error: .command.sh: line 2: 31 Killed minimap2 -t "1" -k 15 -I 1000G -d "genome_index.mmi" "final_non_redundant_transcriptome.fasta"
Relevant log output
Application activity log entry
No response
Were you able to successfully run the latest version of the workflow with the demo data?
yes
Other demo data information
No response