bcgsc / RNA-Bloom

:hibiscus: reference-free transcriptome assembly for short and long reads
Other
85 stars 7 forks source link

RNABloom not creating transcriptome files #62

Open dvirdi01 opened 8 months ago

dvirdi01 commented 8 months ago

I ran rnabloom on several of my input files individually. out of my 10 samples, it did not produce any transcriptome file for 2 of them. Why is that? It finished running but there are no output files. I ran the following command: rnabloom -long sample.fastq -t 48 -outdir .../.../sample -k 10 -e 5

These are my versions:

kmnip commented 8 months ago

Please remove the options -k 10 -e 5 and try again. If the issue persists, please report the entire log file.

dvirdi01 commented 8 months ago

Hi, I have run my files with both -k 10 e 5 and with the default settings in the past. It produces the transcriptome file for the default setting command but not for -k 10 e 5. This is the message I get for the -k 10 -e5 one:

Parsing sample.fastq... Parsed 21,301,762 sequences in 2h 7m 13s DBG Bloom filter FPR: 1.04 % Counting Bloom filter FPR: 1.04 % Stage 1 completed in 2h 7m 13s

Stage 2: Correct long reads for "rnabloom" Parsing sample.fastq... Corrected Read Lengths Sampling Distribution (n=10000) min q1 med q3 max 10 193 364 715 2643 Parsed 21,301,762 sequences. Kept: 21,301,202 (100.0 %) Discarded: 560 (0.00263 %) Artifacts: 4,220,516 (19.81299%) Corrected reads in 1h 43m 33s Extracting seed sequences... strobemers: n=3, k=11, wMin=12, wMax=61, depth=3 Bloom filter FPR: 91.0 % before: 20,782,469 after: 15,730 (0.0757 %) too short: 0 Extraction completed in 43m 1s Stage 2 completed in 2h 26m 35s

Stage 3: Assemble long reads for "rnabloom" Overlapping sequences... Parsed 231,695 overlap records in 15.355s total reads: 15,730

This is the log file: [racon::Polisher::initialize] loaded target sequences 0.043017 s [racon::Polisher::initialize] loaded sequences 174.185133 s Killed

I also tried giving it multiple files as input so it does a combined assembly with default settings. However, it gives me this error:

Exception: java.lang.OutOfMemoryError thrown from the UncaughtExceptionHandler in thread "Thread-836" Exception in thread "Thread-825" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-830" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-832" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-854" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-838" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-828" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-822" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-844" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-823" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-816" java.lang.OutOfMemoryError: Java heap space

Exception: java.lang.OutOfMemoryError thrown from the UncaughtExceptionHandler in thread "Thread-853" Exception in thread "Thread-839" Exception in thread "Thread-850" java.lang.OutOfMemoryError: Java heap space java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-860" Exception in thread "Thread-852" java.lang.OutOfMemoryError: Java heap space java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-841" java.lang.OutOfMemoryError: Java heap space

Exception: java.lang.OutOfMemoryError thrown from the UncaughtExceptionHandler in thread "Thread-827" Exception in thread "Thread-846" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-821" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-831" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-835" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-849" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-817" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-859" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-824" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-843" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-848" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-863" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-818" java.lang.OutOfMemoryError: Java heap space Exception in thread "Thread-851" java.lang.OutOfMemoryError: Java heap space: failed reallocation of scalar replaced objects Exception in thread "Thread-847" java.lang.OutOfMemoryError: Java heap space ERROR: Cannot throw exception because the return value of "rnabloom.RNABloom$LongReadCorrectionWorker.getExceptionCaught()" is null java.lang.NullPointerException: Cannot throw exception because the return value of "rnabloom.RNABloom$LongReadCorrectionWorker.getExceptionCaught()" is null at rnabloom.RNABloom.correctLongReadsMultithreaded(RNABloom.java:4006) at rnabloom.RNABloom.correctLongReads(RNABloom.java:5266) at rnabloom.RNABloom.main(RNABloom.java:7346)

kmnip commented 8 months ago

For the first run, the Racon command was killed for some unknown reason. I don't think it has to with memory or the -k or -e options.

The second run with multiple input files ran out of Java heap space. As mentioned previously, you can increase the max Java heap space if you have enough RAM.