Closed badusername21 closed 2 years ago
Hey @badusername21,
Have you tested this again with a more recent release? I'm sorry this slipped past me here. Typically that step is quite rapid, seconds as opposed to minutes. If you are still troubleshooting this late in the game give it a go with the most recent release and see if the issue persists. Happy to reopen the issue if the case.
Previously, we had not been using the TSS enrichment process while processing our samples, but after introducing this aspect into the processing, it appears as though whenever we run a sample it stops at the TSS enrichment part and does not appear to move on from it even after giving it a couple days for a single sample.
Pipeline run code and environment:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/pepatac/pipelines/pepatac.py --sample-name 305A --genome GRCm38_with_viruses_and_spikes --input ../ATAC/305A_L000_R1_001.fastq.gz --single-or-paired paired -O ./new_processed/results_pipeline -P 16 -M 16000 --input2 ../ATAC/305A_L000_R2_001.fastq.gz --aligner bwa --genome-size 1.87e9 --trimmer skewer --deduplicator samblaster --peak-type fixed --extend 250 --frip-ref-peaks None
Version log:
/varidata/research/software/BBC/F-Seq2/build/lib/python3.8/site-packages/pypiper
/varidata/research/projects/immunograph/FASTQs/pepatac_results/pepatac/pipelines
Arguments passed to pipeline:
TSS_name
:None
aligner
:bwa
anno_name
:None
blacklist
:None
config_file
:pepatac.yaml
cores
:16
deduplicator
:samblaster
dirty
:False
extend
:250
force_follow
:False
frip_ref_peaks
:None
genome_assembly
:GRCm38_with_viruses_and_spikes
genome_size
:1.87e9
input
:['../ATAC/305A_L000_R1_001.fastq.gz']
input2
:['../ATAC/305A_L000_R2_001.fastq.gz']
keep
:False
lite
:False
logdev
:False
mem
:16000
motif
:False
new_start
:False
no_fifo
:False
no_scale
:False
output_parent
:./new_processed/results_pipeline
paired_end
:True
peak_caller
:macs2
peak_type
:fixed
prealignments
:[]
prioritize
:False
recover
:False
sample_name
:305A
silent
:False
single_or_paired
:paired
skipqc
:False
sob
:False
testmode
:False
trimmer
:skewer
verbosity
:None
Local input file: ../ATAC/305A_L000_R1_001.fastq.gz Local input file: ../ATAC/305A_L000_R2_001.fastq.gz
Merge/link and fastq conversion: (06-24 13:11:55) elapsed: 0.0 TIME
Number of input file sets: 2 Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/raw/305A_R1.fastq.gz
Local input file: '/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/raw/305A_R1.fastq.gz' Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/raw/305A_R2.fastq.gz
Local input file: '/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/raw/305A_R2.fastq.gz' Found .fastq.gz file Found .fq.gz file; no conversion necessary Found .fastq.gz file Found .fq.gz file; no conversion necessary Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/fastq/305A_R1.fastq.gz
,/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/fastq/305A_R2.fastq.gz
Adapter trimming: (06-24 13:12:21) elapsed: 26.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/fastq/305A_R1_trim.fastq
Thu Jun 24 13:12:55 2021[0;32m >> done[0m (34.586s) 9981583 read pairs processed; of these: 87 ( 0.00%) short read pairs filtered out after trimming by size control 67 ( 0.00%) empty read pairs filtered out after trimming by size control 9981429 (100.00%) read pairs available; of these: 910421 ( 9.12%) trimmed read pairs available after processing 9071008 (90.88%) untrimmed read pairs available after processing log has been saved to "/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/fastq/305A-trimmed.log". Command completed. Elapsed time: 0:00:35. Running peak memory: 0.026GB.
PID: 120787; Command: skewer; Return code: 0; Memory used: 0.026GB
Evaluating read trimming
Prealignments (06-24 13:13:52) elapsed: 91.0 TIME
You may use
--prealignments
to align to references before the genome alignment step. See docs.Compress all unmapped read files (06-24 13:13:52) elapsed: 0.0 TIME
Map to genome (06-24 13:13:52) elapsed: 0.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_sort_dedup.bam
Missing stat 'Mitochondrial_reads'
Calculate NRF, PBC1, and PBC2 (06-24 13:23:08) elapsed: 556.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_bamQC.tsv
Remove duplicates and produce signal tracks (06-24 13:23:20) elapsed: 12.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_sort_dedup.bam
Calculate distribution of reads across nucleosomes (06-24 13:24:50) elapsed: 91.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_NFR.bam
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_mono.bam
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_di.bam
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_tri.bam
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_poly.bam
Calculate library complexity (06-24 13:26:58) elapsed: 128.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_preseq_out.txt
sample size: 1000000 sample size: 2000000 sample size: 3000000 sample size: 4000000 sample size: 5000000 sample size: 6000000 sample size: 7000000 sample size: 8000000 Command completed. Elapsed time: 0:00:41. Running peak memory: 13.199GB.
PID: 121903; Command: preseq; Return code: 0; Memory used: 0.004GB
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_preseq_yield.txt
[ESTIMATING YIELD CURVE] [BOOTSTRAPPING HISTOGRAM] ..._............._............_................... .........................................__... ......... [COMPUTING CONFIDENCE INTERVALS] [WRITING OUTPUT] Command completed. Elapsed time: 0:00:42. Running peak memory: 13.199GB.
PID: 121934; Command: preseq; Return code: 0; Memory used: 0.004GB
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_preseq_counts.txt
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_preseq_plot.pdf
,/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_preseq_plot.png
Library complexity plot completed!
Command completed. Elapsed time: 0:00:07. Running peak memory: 13.199GB.
PID: 121977; Command: Rscript; Return code: 0; Memory used: 0.21GB
Produce signal tracks (06-24 13:28:46) elapsed: 108.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes_exact/305A_exact.bw
,/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/aligned_GRCm38_with_viruses_and_spikes/305A_smooth.bw
Calculate TSS enrichment (06-24 13:33:41) elapsed: 295.0 TIME
Target to produce:
/varidata/research/projects/immunograph/FASTQs/pepatac_results/new_processed/results_pipeline/305A/QC_GRCm38_with_viruses_and_spikes/305A_TSS_enrichment.txt