arnavmdas / epiphany

MIT License
10 stars 2 forks source link

Epiphany

Epiphany: predicting Hi-C contact maps from 1D epigenomic signals

Epiphany, a neural network to predict cell-type-specific Hi-C contact maps from widely available epigenomic tracks. Epiphany uses bidirectional long short-term memory layers to capture long-range dependencies and optionally a generative adversarial network architecture to encourage contact map realism and sharpness. Epiphany shows excellent generalization to held-out chromosomes within and across cell types, yields accurate TAD and interaction calls, and predicts structural changes caused by perturbations of epigenomic signals.

Model Input and Output

Epiphany is creating a connection between 1D epigenomic signals and the 3D chromatin structure, enabling the interpretation of feature importance of epigenomic signals from specific tracks in relation to structural changes. Any combination of epigenomic tracks can be used as input. Through our ablation analysis, we found that using a two-track combination (ATAC + CTCF) along yields commendable prediction quality. Furthermore, incorporating ATAC or CTCF in conjunction with other relevant epigenomic tracks as the input set significantly enhances the predictive capabilities.

Roadmap

This repo includes scripts and related files for the Epiphany model [preprint].

Resource repo: Zenodo DOI

Quick start training

Clone Repository

git clone https://github.com/arnavmdas/epiphany.git

Training

Move to training directory

cd epiphany/epiphany

Download dataset from google drive

mkdir ./Epiphany_dataset
cd ./Epiphany_dataset
wget --no-check-certificate https://drive.google.com/drive/u/2/folders/1UJX6cp-4s0Jbud9jovzuaqnBeORg5R8x -O GM12878_X.h5
wget --no-check-certificate https://drive.google.com/drive/u/2/folders/1UJX6cp-4s0Jbud9jovzuaqnBeORg5R8x -O GM12878_y.pickle
cd ..

Run training script

python3 adversarial.py --wandb

Prediction using pretrained models

Contact

If you have any questions, please feel free to contact Rui Yang (ruy4001@med.cornell.edu), Arnav Das (arnavmd2@uw.edu).