Open yuanhb1993 opened 5 days ago
Hi Yuan,
Sorry to hear about the difficulties in running MitoHPC.
Your run.all.sh script seems fine. You only have one sample (hanwei_bqsr) that you are trying to analyze, correct ?
Just make sure that the bam was generated by aligning the reads in paired-end mode using one of the standard aligners (bwa,bowtie2,minimap2...), sorted and indexed .
Then run "bash ./run.all.sh"
If you could send me the output of the "ls -l -t1 out/ " command, I might get a better idea what the issue is.
Hi, I am currently learning to use MitoHPC to analyze WGS data. After successfully setting up the environment and testing it according to the tutorial, the script did not perform the analysis or produce any output (just create some directory in output directory) when I ran it officially.
Could you please let me know if I missed anything or if there are any configuration errors?
`bash ./run.all.sh
Here is the content of my configuration file: init.sh: `cat init.sh
!/usr/bin/env bash
set -e
if [ -z $HP_SDIR ] ; then echo "Variable HP_SDIR not defined. Make sure you followed the SETUP ENVIRONMENT instructions" ; fi
##############################################################################################################
Program that setups the environmnet
Variable HP_SDIR must be pre-set !!!
##############################################################################################################
DIRECTORY PATHS
export HP_HDIR=
readlink -f $HP_SDIR/..
#HP home directory export HP_BDIR=$HP_HDIR/bin/ #bin directory export HP_JDIR=$HP_HDIR/java/ #java directoryHuman
export HP_RDIR=$HP_HDIR/RefSeq/ #reference directory
Mouse
export HP_RDIR=$HP_HDIR/RefSeqMouse/ #Mouse reference directory
###############################################################
SOFTWARE PATH
export PATH=$HP_SDIR:$HP_BDIR:$PATH
################################################################
ALIGNMNET REFERENCE
hs38DH(default)
export HP_RNAME=hs38DH export HP_RMT=chrM export HP_RNUMT="chr1:629084-634672 chr17:22521208-22521639" # 150bp reads chr2:32916199-32916630
export HP_RNUMT="chr1:629080-634925 chr1:76971223-76971280 chr2:148881723-148881858 chr5:80651184-80651597 chr11:10508892-10509738 chr13:109424123-109424381 chr17:22521208-22521639" # 100bp reads
export HP_RCOUNT=3366 # 195(hs38DH-no_alt); 194 (hs38DH-no_alt_EBV) export HP_RURL=ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/GRCh38_reference_genome/GRCh38_full_analysis_set_plus_decoy_hla.fa
hg19
export HP_RNAME=hg19
export HP_RMT=chrM
export HP_RNUMT="chr1:564465-569708 chr17:22020692-22020827"
export HP_RCOUNT=93
export HP_RURL=http://hgdownload.cse.ucsc.edu/goldenpath/hg19/bigZips/hg19.fa.gz
CHM13
export HP_RNAME=chm13
export HP_RMT=chrM
export HP_RNUMT="chr5:81136887-81137073 chr11:10594619-10594811 chr13:12167063-12168420 chr17:23209322-23209958"
export HP_RCOUNT=24
export HP_RURL=https://s3-us-west-2.amazonaws.com/human-pangenomics/T2T/CHM13/assemblies/chm13.draft_v1.1.fasta.gz
CHM13.v2; to compute NUMT regions
export HP_RURL=https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/009/914/755/GCF_009914755.1_T2T-CHM13v2.0/GCF_009914755.1_T2T-CHM13v2.0_genomic.fna.gz
Mouse
export HP_RNAME=mm39
export HP_RMT=chrM
export HP_RCOUNT=61
export HP_RNUMT="chr1:24650615-24655265 chr2:22477298-22480547 chr10:95913385-95913756 chr12:97028201-97028567 chr13:85274752-85275567"
export HP_RURL="https://hgdownload.soe.ucsc.edu/goldenPath/mm39/bigZips/mm39.fa.gz"
###############################################################
export HP_E=300 # extension(circularization)
################################################################
GENOME REFERENCES
export HP_O=Human # organism: Human, Mouse... export HP_MT=chrM # chrM, rCRS or RSRS, FASTA file available under $HP_RDIR export HP_MTC=chrMC export HP_MTR=chrMR export HP_MTLEN=16569 export HP_NUMT=NUMT # NUMT FASTA file under $HP_RDIR
Mouse
export HP_O=Mouse # organism: Human, Mouse...
export HP_MT=chrM # chrM, rCRS or RSRS, FASTA file available under $HP_RDIR
export HP_MTC=chrMC
export HP_MTR=chrMR
export HP_MTLEN=16299
export HP_NUMT=NUMT # NUMT FASTA file under $HP_RDIR
################################################################
OTHER
export HP_CN=1 # do compute mtDNA copy number export HP_L=222000 # number of MT reads to subsample; empty: no subsampling; 222000 150bp reads => ~2000x MT coverage export HP_FOPT="-q 15 -e 0" # FASTP options: Ex: " -q 20 -e 30 "; -q: min base quality; -e: avg quality thold export HP_DOPT="--removeDups" # samblaster option; leave empty if no deduplication should be done export HP_GOPT="--mitochondria-mode" # gatk mutect2 additional options : Ex "-max-reads-per-alignment-start 50" , "--mitochondria-mode"
export HP_M=mutect2 # SNV caller: mutect2,mutserve or freebayes export HP_I=2 # number of SNV iterations : 0,1,2
0: compute read counts,mtDNA-CN
export HP_T1=03 # heteroplasmy tholds export HP_T2=05 export HP_T3=10
export HP_DP=50 # minimum coverage; lowered from 100 to 50 export HP_V=gridss # SV caller: gridssexport HP_DP=100 # minimum coverage: Ex 100
export HP_FRULE="perl -ane 'print unless(/strict_strand|strand_bias|base_qual|map_qual|weak_evidence|slippage|position|Homopolymer/ and /:0.[01234]\d+$/);' | bcftools filter -e 'DP<$HP_DP'" # filter rule (or just "tee")
export HP_P=10 # number of processors export HP_MM="30G" # maximum memory export HP_JOPT="-Xms$HP_MM -Xmx$HP_MM -XX:ParallelGCThreads=$HP_P" # JAVA options ################################################################
INPUT/OUTPUT
PWD=
pwd -P
export HP_FDIR=$PWD/f # fastq input file directory ; .fq or .fq.gz file extension
export HP_ADIR=/mnt/data/WGS_sun/5.gatk/bqsr # bams or crams input file directory export HP_ODIR=$PWD/out/ # output dir export HP_IN=$PWD/in.txt # input file to be generated `
run.all.sh: `cat run.all.sh
!/usr/bin/env bash
set -eux
export HP_SDIR=/mnt/data/hbyuan/application/MitoHPC/MitoHPC/scripts export HP_BDIR=/mnt/data/hbyuan/application/MitoHPC/MitoHPC/bin/ export HP_JDIR=/mnt/data/hbyuan/application/MitoHPC/MitoHPC/java/ export HP_RDIR=/mnt/data/hbyuan/application/MitoHPC/MitoHPC/RefSeq/ export HP_ODIR=/mnt/data/hbyuan/project/WGS_TLS/out/ export HP_RNAME=hs38DH export HP_RMT=chrM export HP_RNUMT="chr1:629084-634672 chr17:22521208-22521639" export HP_RCOUNT=3366 export HP_RURL="ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/GRCh38_reference_genome/GRCh38_full_analysis_set_plus_decoy_hla.fa" export HP_O=Human export HP_MT=chrM export HP_MTR=chrMR export HP_MTC=chrMC export HP_MTLEN=16569 export HP_NUMT=NUMT export HP_CN=1 export HP_E=300 export HP_L=222000 export HP_IN=/mnt/data/hbyuan/project/WGS_TLS/in.txt export HP_ODIR=/mnt/data/hbyuan/project/WGS_TLS/out/ export HP_M=mutect2 export HP_I=2 export HP_T1=03 export HP_T2=05 export HP_T3=10 export HP_V=gridss export HP_DP=50 export HP_FRULE="perl -ane 'print unless(/strict_strand|strand_bias|base_qual|map_qual|weak_evidence|slippage|position|Homopolymer/ and /:0.[01234]\d+$/);' | bcftools filter -e 'DP<50'" export HP_FOPT="-q 15 -e 0" export HP_DOPT="--removeDups" export HP_GOPT="--mitochondria-mode" export HP_JOPT="-Xms30G -Xmx30G -XX:ParallelGCThreads=10" export HP_MM="30G" export HP_P=10 export PATH=/mnt/data/hbyuan/application/MitoHPC/MitoHPC/scripts:/mnt/data/hbyuan/application/MitoHPC/MitoHPC/bin/:$PATH
bash /mnt/data/hbyuan/application/MitoHPC/MitoHPC/scripts/filter.sh hanwei_bqsr /mnt/data/UKB/WGS_sun/5.gatk/bqsr/hanwei_bqsr.bam /mnt/data/hbyuan/project/WGS_TLS/out//hanwei_bqsr/hanwei_bqsr
bash /mnt/data/hbyuan/application/MitoHPC/MitoHPC/scripts/getSummary.sh /mnt/data/hbyuan/project/WGS_TLS/out/ `