alexdobin / STAR

RNA-seq aligner
MIT License
1.85k stars 506 forks source link

genomeGenerate Aborted with std::bad_malloc #28

Closed rronen closed 9 years ago

rronen commented 9 years ago

Hi Alex,

I am preparing STAR for use and getting this failure on genomeGenerate. I've tried twice, the second time adding --limitGenomeGenerateRAM 30000000000 but this doesn't seem to help. It failed both times having written ~47Gb in the genomeDir and seemed to be using ~19Gb or RAM most of the time (though I'm not sure exactly how much it used prior to the crash).

ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$
ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ STAR --genomeDir /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star --genomeFastaFiles /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa --runMode genomeGenerate --sjdbOverhang 99 --sjdbGTFfile /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf --genomeSAindexNbases 14 --runThreadN 16 --limitGenomeGenerateRAM 30000000000
Mar 20 16:39:32 ..... Started STAR run
Mar 20 16:39:32 ... Starting to generate Genome files
Mar 20 16:40:58 ... finished processing splice junctions database ...
Mar 20 16:41:17 ... starting to sort  Suffix Array. This may take a long time...
Mar 20 16:41:37 ... sorting Suffix Array chunks and saving them to disk...
Mar 20 16:54:25 ... loading chunks from disk, packing SA...
terminate called after throwing an instance of 'std::bad_alloc'
  what():  std::bad_alloc
Aborted (core dumped)
ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ echo `STAR --version`
STAR_2.4.0e
ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ 

Any help resolving this would be much appreciated!

Thanks, Roy


Some extracts from Log:

STAR version=STAR_2.4.0e
STAR compilation time,server,dir=Fri Oct 24 10:43:53 EDT 2014 verona.cshl.edu:/sonas-hs/gingeras/nlsas_norepl/user/dobin/STAR/STAR.sandbox/source
##### Command Line:
STAR --genomeDir /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star --genomeFastaFiles /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa --runMode genomeGenerate --sjdbOverhang 99 --sjdbGTFfile /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf --genomeSAindexNbases 14 --runThreadN 16 --limitGenomeGenerateRAM 30000000000
##### Initial USER parameters from Command Line:
###### All USER parameters from Command Line:
genomeDir                     /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star     ~RE-DEFINED
genomeFastaFiles              /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa        ~RE-DEFINED
runMode                       genomeGenerate     ~RE-DEFINED
sjdbOverhang                  99     ~RE-DEFINED
sjdbGTFfile                   /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf     ~RE-DEFINED
genomeSAindexNbases           14     ~RE-DEFINED
runThreadN                    16     ~RE-DEFINED
limitGenomeGenerateRAM        30000000000     ~RE-DEFINED
##### Finished reading parameters from all sources

##### Final user re-defined parameters-----------------:
runMode                           genomeGenerate
runThreadN                        16
genomeDir                         /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star
genomeFastaFiles                  /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa   
genomeSAindexNbases               14
limitGenomeGenerateRAM            30000000000
sjdbGTFfile                       /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf
sjdbOverhang                      99

-------------------------------
##### Final effective command line:
STAR   --runMode genomeGenerate   --runThreadN 16   --genomeDir /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star   --genomeFastaFiles /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa      --genomeSAindexNbases 14   --limitGenomeGenerateRAM 30000000000   --sjdbGTFfile /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf   --sjdbOverhang 99
Finished loading and checking parameters
Mar 20 16:39:32 ... Starting to generate Genome files
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 0  "1" chrStart: 0
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 1  "2" chrStart: 249298944
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 2  "3" chrStart: 492568576
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 3  "4" chrStart: 690749440
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 4  "5" chrStart: 882114560
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 5  "6" chrStart: 1063256064
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 6  "7" chrStart: 1234436096
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 7  "8" chrStart: 1393819648
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 8  "9" chrStart: 1540358144
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 9  "10" chrStart: 1681653760
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 10  "11" chrStart: 1817444352
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 11  "12" chrStart: 1952710656
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 12  "13" chrStart: 2086666240
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 13  "14" chrStart: 2202009600
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 14  "15" chrStart: 2309488640
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 15  "16" chrStart: 2412249088
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 16  "17" chrStart: 2502688768
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 17  "18" chrStart: 2583953408
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 18  "19" chrStart: 2662072320
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 19  "20" chrStart: 2721316864
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 20  "21" chrStart: 2784493568
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 21  "22" chrStart: 2832728064
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 22  "X" chrStart: 2884108288
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 23  "Y" chrStart: 3039559680
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 24  "MT" chrStart: 3099066368
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 25  "GL000207.1" chrStart: 3099328512
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 26  "GL000226.1" chrStart: 3099590656
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 27  "GL000229.1" chrStart: 3099852800
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 28  "GL000231.1" chrStart: 3100114944
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 29  "GL000210.1" chrStart: 3100377088
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 30  "GL000239.1" chrStart: 3100639232
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 31  "GL000235.1" chrStart: 3100901376
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 32  "GL000201.1" chrStart: 3101163520
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 33  "GL000247.1" chrStart: 3101425664
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 34  "GL000245.1" chrStart: 3101687808
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 35  "GL000197.1" chrStart: 3101949952
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 36  "GL000203.1" chrStart: 3102212096
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 37  "GL000246.1" chrStart: 3102474240
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 38  "GL000249.1" chrStart: 3102736384
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 39  "GL000196.1" chrStart: 3102998528
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 40  "GL000248.1" chrStart: 3103260672
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 41  "GL000244.1" chrStart: 3103522816
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 42  "GL000238.1" chrStart: 3103784960
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 43  "GL000202.1" chrStart: 3104047104
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 44  "GL000234.1" chrStart: 3104309248
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 45  "GL000232.1" chrStart: 3104571392
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 46  "GL000206.1" chrStart: 3104833536
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 47  "GL000240.1" chrStart: 3105095680
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 48  "GL000236.1" chrStart: 3105357824
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 49  "GL000241.1" chrStart: 3105619968
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 50  "GL000243.1" chrStart: 3105882112
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 51  "GL000242.1" chrStart: 3106144256
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 52  "GL000230.1" chrStart: 3106406400
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 53  "GL000237.1" chrStart: 3106668544
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 54  "GL000233.1" chrStart: 3106930688
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 55  "GL000204.1" chrStart: 3107192832
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 56  "GL000198.1" chrStart: 3107454976
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 57  "GL000208.1" chrStart: 3107717120
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 58  "GL000191.1" chrStart: 3107979264
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 59  "GL000227.1" chrStart: 3108241408
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 60  "GL000228.1" chrStart: 3108503552
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 61  "GL000214.1" chrStart: 3108765696
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 62  "GL000221.1" chrStart: 3109027840
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 63  "GL000209.1" chrStart: 3109289984
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 64  "GL000218.1" chrStart: 3109552128
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 65  "GL000220.1" chrStart: 3109814272
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 66  "GL000213.1" chrStart: 3110076416
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 67  "GL000211.1" chrStart: 3110338560
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 68  "GL000199.1" chrStart: 3110600704
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 69  "GL000217.1" chrStart: 3110862848
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 70  "GL000216.1" chrStart: 3111124992
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 71  "GL000215.1" chrStart: 3111387136
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 72  "GL000205.1" chrStart: 3111649280
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 73  "GL000219.1" chrStart: 3111911424
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 74  "GL000224.1" chrStart: 3112173568
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 75  "GL000223.1" chrStart: 3112435712
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 76  "GL000195.1" chrStart: 3112697856
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 77  "GL000212.1" chrStart: 3112960000
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 78  "GL000222.1" chrStart: 3113222144
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 79  "GL000200.1" chrStart: 3113484288
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 80  "GL000193.1" chrStart: 3113746432
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 81  "GL000194.1" chrStart: 3114008576
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 82  "GL000225.1" chrStart: 3114270720
/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa : chr # 83  "GL000192.1" chrStart: 3114532864
Processing sjdbGTFfile=/home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf, found:
                196501 transcripts
                1195764 exons (non-collapsed)
                344569 collapsed junctions
Mar 20 16:40:58 ... finished processing splice junctions database ...
Writing genome to disk...Writing 3183874000 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/Genome ; empty space on disk = 166107160576 bytes ... done
 done.
Number of SA indices: 5865990696
SA size in bytes: 24197211622
Mar 20 16:41:17 ... starting to sort  Suffix Array. This may take a long time...
Number of chunks: 62;   chunks size limit: 886207608 bytes
Mar 20 16:41:37 ... sorting Suffix Array chunks and saving them to disk...
Writing 599357856 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_1 ; empty space on disk = 162923352064 bytes ... done
Writing 577153280 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_13 ; empty space on disk = 162322223104 bytes ...Writing 619007584 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_4 ; empty space on disk = 162035646464 bytes ... done
 done
Writing 634268672 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_14 ; empty space on disk = 161122529280 bytes ...Writing 716890648 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_0 ; empty space on disk = 160770732032 bytes ... done
 done
Writing 714483232 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_3 ; empty space on disk = 159772667904 bytes ... done
Writing 694834640 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_11 ; empty space on disk = 159056072704 bytes ... done
Writing 752341344 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_2 ; empty space on disk = 158359187456 bytes ...Writing 722691600 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_10 ; empty space on disk = 157626068992 bytes ... done
Writing 717814040 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_15 ; empty space on disk = 157309399040 bytes ...Writing 773328688 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_9 ; empty space on disk = 157141749760 bytes ...Writing 794598600 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_6 ; empty space on disk = 156506316800 bytes ...Writing 750742752 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_12 ; empty space on disk = 155990577152 bytes ... done
Writing 810151080 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_5 ; empty space on disk = 155494641664 bytes ...Writing 829488920 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_8 ; empty space on disk = 155056332800 bytes ...Writing 858391408 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_7 ; empty space on disk = 154779631616 bytes ... done
 done
 done
 done
 done
 done
 done
Writing 741898600 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_17 ; empty space on disk = 151357706240 bytes ... done
Writing 667114832 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_18 ; empty space on disk = 150613618688 bytes ... done
Writing 706991272 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_20 ; empty space on disk = 149946707968 bytes ... done
Writing 840425976 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_16 ; empty space on disk = 149237633024 bytes ... done
Writing 886122488 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_19 ; empty space on disk = 148401233920 bytes ... done
Writing 793602056 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_28 ; empty space on disk = 147515113472 bytes ... done
Writing 723365672 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_25 ; empty space on disk = 146719170560 bytes ...Writing 674447488 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_23 ; empty space on disk = 146449977344 bytes ... done
 done
Writing 711395888 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_29 ; empty space on disk = 145317228544 bytes ...Writing 700092856 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_27 ; empty space on disk = 144836620288 bytes ... done
Writing 751083424 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_21 ; empty space on disk = 144228253696 bytes ... done
 done
Writing 792413336 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_22 ; empty space on disk = 143148273664 bytes ... done
Writing 861232896 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_26 ; empty space on disk = 142359969792 bytes ... done
Writing 826416264 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_31 ; empty space on disk = 141496197120 bytes ...Writing 844898024 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_30 ; empty space on disk = 140956876800 bytes ...Writing 863024328 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_24 ; empty space on disk = 140389912576 bytes ... done
 done
 done
Writing 745171728 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_32 ; empty space on disk = 138973077504 bytes ... done
Writing 772097912 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_33 ; empty space on disk = 138225704960 bytes ... done
Writing 789894584 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_34 ; empty space on disk = 137455796224 bytes ... done
Writing 757973088 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_36 ; empty space on disk = 136665890816 bytes ... done
Writing 547743480 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_43 ; empty space on disk = 135907909632 bytes ... done
Writing 762137728 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_38 ; empty space on disk = 135358545920 bytes ... done
Writing 682276800 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_47 ; empty space on disk = 134594158592 bytes ...Writing 870757576 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_35 ; empty space on disk = 134004498432 bytes ... done
Writing 737698816 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_37 ; empty space on disk = 133276700672 bytes ...Writing 688834280 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_44 ; empty space on disk = 132982648832 bytes ... done
 done
 done
Writing 812803624 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_39 ; empty space on disk = 131607408640 bytes ... done
Writing 727867312 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_41 ; empty space on disk = 130794450944 bytes ... done
Writing 804211744 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_45 ; empty space on disk = 130066444288 bytes ...Writing 796289520 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_40 ; empty space on disk = 129299873792 bytes ... done
Writing 842436704 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_42 ; empty space on disk = 128590712832 bytes ... done
 done
Writing 872813384 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_46 ; empty space on disk = 127623057408 bytes ... done
Writing 674809448 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_48 ; empty space on disk = 126761959424 bytes ... done
Writing 711654952 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_49 ; empty space on disk = 126085156864 bytes ... done
Writing 823461280 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_50 ; empty space on disk = 125375483904 bytes ... done
Writing 558710384 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_57 ; empty space on disk = 124549591040 bytes ... done
Writing 718695328 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_61 ; empty space on disk = 123993296896 bytes ... done
Writing 841699200 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_54 ; empty space on disk = 123272478720 bytes ... done
Writing 857675400 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_52 ; empty space on disk = 122428297216 bytes ... done
Writing 686570256 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_60 ; empty space on disk = 121568079872 bytes ... done
Writing 655735720 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_56 ; empty space on disk = 120879484928 bytes ... done
Writing 869190152 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_53 ; empty space on disk = 120226410496 bytes ... done
Writing 855636920 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_55 ; empty space on disk = 119357186048 bytes ... done
Writing 785017520 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_58 ; empty space on disk = 118499024896 bytes ...Writing 879050080 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_51 ; empty space on disk = 117840498688 bytes ... done
Writing 848940904 bytes into /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star/SA_59 ; empty space on disk = 116834627584 bytes ... done
 done
Mar 20 16:54:25 ... loading chunks from disk, packing SA...
rronen commented 9 years ago

Update: also tried with --genomeSAindexNbases 4 as I saw on some forums/lists it might help resolve such issues. No luck.

rronen commented 9 years ago

Sorry, just noticed you may have fixed this on 2.4.0i.

2.4.0i 01/14/2015
Fixed a bug with the _STARtmp temporary directory name for the 2-pass runs.
Fixed a bug causing seg-faults for genome generation.
Fixed a bug causing seg-faults for --quantMode TranscriptomeSAM

I got the current (static) executable & running no -- will close this as soon as that works (hopefully).

rronen commented 9 years ago

Nope, getting the exact same error with latest executable:

ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ ./STAR --genomeDir /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../star --genomeFastaFiles /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/GRCh37.fa --runMode genomeGenerate --sjdbOverhang 99 --sjdbGTFfile /home/ubuntu/bcbio_datadir/genomes/Hsapiens/GRCh37/seq/../rnaseq/ref-transcripts.gtf --genomeSAindexNbases 14 --runThreadN 16 --limitGenomeGenerateRAM 30000000000
Mar 20 18:44:13 ..... Started STAR run
Mar 20 18:44:13 ... Starting to generate Genome files
Mar 20 18:44:24 ... Starting GTF processing
Mar 20 18:44:42 ... Finished GTF processing
Mar 20 18:45:18 ... finished processing splice junctions database ...
Mar 20 18:45:36 ... starting to sort  Suffix Array. This may take a long time...
Mar 20 18:45:55 ... sorting Suffix Array chunks and saving them to disk...
Mar 20 18:58:56 ... loading chunks from disk, packing SA...
terminate called after throwing an instance of 'std::bad_alloc'
  what():  std::bad_alloc
Aborted (core dumped)
ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ du -h .
4.0K    ./fail_11am/_STARtmp
47G     ./fail_11am
4.0K    ./_STARtmp
94G     .
ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ 
ubuntu@master:~/bcbio_datadir/genomes/Hsapiens/GRCh37/star$ ./STAR --version
STAR_2.4.0j_modified

Any suggestions would be most welcome.

Thanks, Roy

alexdobin commented 9 years ago

Hi Roy,

how much RAM do you have available? For this genome you need at least 30GB.

Cheers Alex

rronen commented 9 years ago

I think I have exactly that. Do you expect this is causing the issues?

ubuntu@master:/bcbio_work/ubuntu/encrypted/STAR_index$ cat /proc/meminfo
MemTotal:       30610040 kB

* this is an AWS/EC2 instance type c3.4xlarge

alexdobin commented 9 years ago

I think this may not be enough, since it's 29GB. It will probably be enough for mapping, but genome generation requires 1-2GB more RAM than mapping. Do you have a server with slightly more RAM?

rronen commented 9 years ago

I see, yes I can manage something larger. Will give it a spin with more RAM shortly. Thanks!

Update: this was indeed the issue. It finished without problems on a larger server.