CWL-defined pipeline for using IDR to produce a set of peaks given two replicate eCLIP peaks
module load eclipidrmergepeaks/0.1.0
)cpan install Statistics::Basic
cpan install Statistics::Distributions
cpan install Statistics::R
(see the examples/merge_peaks_1input.yaml or examples/merge_peaks_2inputs.yaml manifest file for a full example). Below is a description of all fields required to be filled out in the manifest file:
First, use the example template to fill out the names and paths pertaining to your samples. The shebang "#!" line will depend on your experimental setup (either 2 replicates with 2 corresponding inputs, or 2 replicates normalized over 1 input).
This should match what was used to call CLIPper peaks.
species: hg19
BAM file containing the merged-barcode (read 2 only) PCR-deduped CLIP reads mapping to the genome for Replicate 1. Replace "rep1" with a unique ID for each rep1.
- name: "rep1"
ip_bam:
class: File
path: /home/centos/peCLIP_inputs/ENCFF994WPX.r2.bam
BAM file containing the merged-barcode (read 2 only) PCR-deduped INPUT reads mapping to the genome for Replicate 1.
input_bam:
class: File
path: /home/centos/peCLIP_inputs/ENCFF590UCY.r2.bam
BED file containing the called peak clusters for Replicate 1 Output from either CLIPPER or input-normed peaks. This pipeline will perform input norm internally for you, so it won't really matter which file you use.
peak_clusters:
class: File
path: /home/centos/peCLIP_inputs/ENCFF639MYI.bed6
BAM file containing the merged-barcode (read 2 only) PCR-deduped CLIP reads mapping to the genome for Replicate 2. Replace "rep2" with a unique ID for each rep2.
- name: "rep2"
ip_bam:
class: File
path: /home/centos/peCLIP_inputs/ENCFF154BQS.r2.bam
BAM file containing the merged-barcode (read 2 only) PCR-deduped INPUT reads mapping to the genome for Replicate 2.
input_bam:
class: File
path: /home/centos/peCLIP_inputs/ENCFF590UCY.r2.bam
BED file containing the called peak clusters for Replicate 2 Output from CLIPPER or input-normed peaks. This pipeline will perform input norm internally for you.
peak_clusters:
class: File
path: /home/centos/peCLIP_inputs/ENCFF664WCU.bed6
Merged reproducible peaks are reported as:
rep1.vs.rep2.bed
Where rep1
and rep2
are the user-defined names in the manifest.
./merge_peaks_2inputs.yaml
create_environment_merge_peaks.sh
will install perl v5.22.0, which is different from the version tested on TSCC
(5.10.1). Since 5.18, there have been slight changes resulting in hash keys
being accessed in a non-deterministic way. Installing 5.22.0 will result in
minor changes from the reference, but will otherwise give similar outputs.
Included is a script run_perlbrew_perl5.10.1.sh
which will attempt to
install perl 5.10.1, which will give you deterministic results.