lcdb / lcdb-wf

Robust, tested workflows for RNA-seq, ChIP-seq and other high-throughput sequencing analysis
https://lcdb.github.io/lcdb-wf
20 stars 17 forks source link

consider using black/greylists for peak-calling #381

Open daler opened 1 year ago

daler commented 1 year ago

The GreyListChIP package describes the algorithm, which seems straightforward. It may or may not make sense to reimplement into a snakemake rule using other tools already installed in the environment. Specifically, it does this:

  1. generating a tiling of the genome,
  2. counting reads from a BAM file for the tiling,
  3. sampling from the counts and fitting the samples to the negative binomial distribution to calculate the read count threshold,
  4. filtering the tiling to identify regions of high signal, then
  5. exporting the resulting set to a bed file.