Open mtxellrb opened 2 years ago
Yes for now the README is focused on motifs. For tracks the script still needs to be written but conceptually it is quite similar to https://github.com/aertslab/create_cisTarget_databases/blob/master/create_cistarget_motif_databases.py, but instead of using scoring motifs with Cluster-buster for a FASTA file with regions/genes/ of interest, you need to have a BED file with your regions/genes and use bigWigAverageOverBed to get the max score per region and rank those. I might look at this code soon as I have to generate some databases myself.
@ghuls I am also trying to understand whether I can use peak files generated from my ChiPSeq data!
yes you can, but you need to make sure you have a lot of ChIPseq tracks in your database as else they will always be enriched in each analysis. For a cisTarget database you just need some input data that you can rank also make sure in case of ties that you randomize those rank assignment so you don't get artificial high rankings for your first regions.
@mtxellrb
A script for creating a track database from bigWig TF ChIP-seq data is now added :create_cistarget_track_databases.py
https://github.com/aertslab/create_cisTarget_databases#create_cistarget_track_databasespy
Hi,
Maybe I'm getting this totally wrong, but from the README file it seems that to annotate regulatory regions for each gene or region, you can either use a motif annotation generated by cluster-buster or you can use Chip-Seq tracks instead. However, the description seems to be focused entirely on motif annotation. Could you be so kind to provide me with an pipeline example for bigWig files of TF ChIP-seq data and gene fasta files? Thanks!
Best,
Meritxell