Closed DaGaMs closed 4 years ago
Any way I can access this data to reproduce?
Also, do you really need to be calling all contigs in hg38? Probably a large amount of time is spent processing alt, unplaced, and the decoy contig that most people don't use. It's also where the error occurs. I'd highly recommend restricting calling to the standard chromosomes unless you have plans for the other contigs.
Also, you don't need to use bcftools to get bgzipped output; Octopus can do this automatically - it recognises the provided file extension to --output
. So you can just use:
octopus -X 2G -R Data/genomes/hg38_full_gatk_HPV_HBV_HCV_EBV.fa -I tumour.bam control.bam -N control --threads 4 --somatics-only --somatic-forest-file octopus/resources/forests/somatic.v0.6.3-beta.forest -o calls.vcf.gz
Well, I just ran it on everything because... Why not? Who knows, it might be interesting at some point 😉
Anyway, the data are on our private cluster. What file(s) would you want to have access to? I can move some of it to the BDI cluster.
Also, is there a way to rescue the process, or will I have to start from 0? I have some 364MB of temp.bcf
files in octopus-temp
...
Well, I just ran it on everything because... Why not?
Well it could be a third of your runtime is spent in these contigs so that's one reason... I'd at-least skip chrUn_JTFH01000785v1_decoy
as all you'll see nothing useful there.
What file(s) would you want to have access to?
I'll just need the BAM files.
Also, is there a way to rescue the process, or will I have to start from 0?
You can merge all these bcfs to get an unfiltered vcf, then you just need to filter this; you can use the --filter-vcf
option.
Closing due to inactivity. Please re-open if needed.
Describe the bug I ran Octopus (0.6.3-beta) on some whole-genome sequencing cancer data from TCGA as a test. I re-aligned the data to the GATK-provided hg38 full reference, including all
alt
alleles and unassembled contigs. The first step finished ok:but then the "Call Set Refinement" dies:
Version
Command Command line to run octopus: