greenelab / deep-review

A collaboratively written review paper on deep learning, genomics, and precision medicine
https://greenelab.github.io/deep-review/
Other
1.25k stars 271 forks source link

Predicting DNA Methylation State of CpG Dinucleotide Using Genome Topological Features and Deep Networks #68

Open agitter opened 8 years ago

agitter commented 8 years ago

http://doi.org/10.1038/srep19598

gwaybio commented 8 years ago

A deep learning model that uses stacked denoising autoencoders (sDA) to pretrain weights and optimize architecture and a multilayer perceptron (MLP) to predict whether a given CpG is hypo- or hyper-methylated.

Biology

Computation

Why should we include it in our review

I am concerned about several aspects of the study. First, the engineered features could probably be designed more carefully and second, the MLP is trained to predict the output of the first algorithm. The latter leaves me wondering if the MLP is keying in on something biasing the sDA. I was also confused about exactly what the features were and how performance was evaluated.

While I think the paper fell short in these ways it could be discussed as part of learning epigenomic features and integrating 3-dimensional genomic features. I would also say it could be talked about when discussing using unsupervised algorithms for automatic feature construction, but this was not really done here.

agitter commented 7 years ago

@gwaygenomics I am debating whether we need a separate section on methylation in the Study section. Based on your comments here, I'm thinking we do not. Do you agree?

gwaybio commented 7 years ago

@agitter yes, that sounds reasonable. Perhaps including methylation in the "related epigenomic tasks" section is sufficient.

I also think in a lot of ways DNA methylation data is expanding faster than any other genomic platform. See Illumina's new EPIC Beadchip and other newer technologies including hydroxymethylcytosine analyses. Could be a lot of room for deep learning applications! :smile_cat: