Indexing a 2.6GB Plant Genome Using STAR (genomeGenerate Mode) Terminated Due to RAM Limit — Can the Genome Be Split for Indexing, or Are There Other Solutions? #2235
I am trying to index my plant genome that was de novo assembled, using the STAR aligner tool. The assembly file contains 2,976,459 contigs with N50 being 1,293kb.
EXITING because of FATAL PARAMETER ERROR: limitGenomeGenerateRAM=31000000000is too small for your genome
SOLUTION: please specify --limitGenomeGenerateRAM not less than 2080695648522 and make that much RAM available
System capabilities: CPU with 8 cores and 244GB RAM.
One of the suggestions I was given was that the contig counts be the potential culprit. I haven't checked for any duplicates in my assembly file. Since I don't have assembly data of any related species under the same genus, I doubt if scaffolding using tools like RagTag would be helpful. Therefore I am looking forward to suggestions as to how to perform indexing within my system capacity.
Note: my end-goal is to perform BRAKER (with RNA seq and protein data), and for the Stringtie2 to work, the aligned reads need to have XS tags. With STAR aligner, this is possible.
I am trying to index my plant genome that was de novo assembled, using the STAR aligner tool. The assembly file contains 2,976,459 contigs with N50 being 1,293kb.
The following command was used:
And the error that was encountered was
System capabilities: CPU with 8 cores and 244GB RAM.
One of the suggestions I was given was that the contig counts be the potential culprit. I haven't checked for any duplicates in my assembly file. Since I don't have assembly data of any related species under the same genus, I doubt if scaffolding using tools like RagTag would be helpful. Therefore I am looking forward to suggestions as to how to perform indexing within my system capacity.
Note: my end-goal is to perform BRAKER (with RNA seq and protein data), and for the Stringtie2 to work, the aligned reads need to have XS tags. With STAR aligner, this is possible.