Closed TingHsuanChen closed 5 years ago
What coverage do you have? High coverage samples will require more memory when assembling. You
can give GRIDSS more memory by changing the -Xmx16g
to something higher (I typically use -Xmx31g
). Note that values between 32Gb and ~48Gb should not be used as they actually result in less usable memory due to java compressed oops.
I'm trying to analyze our RNAseq data with GRIDSS.
Note that at this time I do no recommend using GRIDSS on RNA-Seq data as only a single assembly is called from each breakend branch.
Eg: Transcript A: exon 1, 2, 3 Transcript B: exon 1, 2, 4
When assembly is performed at the exon 1 boundary, the assembly contig will contain exon 2 then exon 3, or exon 2 then exon 4, but not both. Only the most highly supported assembly graph branch will be called. It is even more problematic if exons 3 and 4 start with the same base as when assembling from exon 2, only one of the two branches will be taken (since the first kmer is shared so they are treated as the same branch).
This will be fixed in a future revision of the GRIDSS assembler but for the moment I cannot in good conscience recommend it for RNA-Seq junction calling as some well-supported junctions will be missing assembly support.
Hi,
I'm trying to analyze our RNAseq data with GRIDSS. I started from a trial on single bam file with a small proportion of the reference genome (chromosome 1). The settings are as follows:
NORMAL=../hisat2_alignment_ch1/01_k100Mm_ch1.bam
REFERENCE=../bwa/Vitis_genome_ch1.fasta
OUTPUT=Ctrl_a_ch1.sv.vcf
ASSEMBLY=${OUTPUT/.sv.vcf/.gridss.assembly.bam}
GRIDSS_JAR=/home/ting-hsuan/gridss.jar
java -ea -Xmx16g \
-Dsamjdk.create_index=true \
-Dsamjdk.use_async_io_read_samtools=true \
-Dsamjdk.use_async_io_write_samtools=true \
-Dsamjdk.use_async_io_write_tribble=true \
-Dsamjdk.compression_level=1 \
-Dgridss.gridss.output_to_temp_file=true \
-Dgridss.defensiveGC=true \
-cp $GRIDSS_JAR gridss.CallVariants \
TMP_DIR=. \
WORKING_DIR=. \
CONFIGURATION_FILE=gridss.properties \
REFERENCE_SEQUENCE="$REFERENCE" \
INPUT="$NORMAL" \
OUTPUT="$OUTPUT" \
ASSEMBLY="$ASSEMBLY" \
WORKER_THREADS=1
It ran smoothly at the beginning, then came with this error message:
Should I add
-XX:-UseGCOverheadLimit
to the java code?Kind regards, Ting-Hsuan