Open agitter opened 8 years ago
A deep learning model that uses stacked denoising autoencoders (sDA) to pretrain weights and optimize architecture and a multilayer perceptron (MLP) to predict whether a given CpG is hypo- or hyper-methylated.
I am concerned about several aspects of the study. First, the engineered features could probably be designed more carefully and second, the MLP is trained to predict the output of the first algorithm. The latter leaves me wondering if the MLP is keying in on something biasing the sDA. I was also confused about exactly what the features were and how performance was evaluated.
While I think the paper fell short in these ways it could be discussed as part of learning epigenomic features and integrating 3-dimensional genomic features. I would also say it could be talked about when discussing using unsupervised algorithms for automatic feature construction, but this was not really done here.
@gwaygenomics I am debating whether we need a separate section on methylation in the Study section. Based on your comments here, I'm thinking we do not. Do you agree?
@agitter yes, that sounds reasonable. Perhaps including methylation in the "related epigenomic tasks" section is sufficient.
I also think in a lot of ways DNA methylation data is expanding faster than any other genomic platform. See Illumina's new EPIC Beadchip and other newer technologies including hydroxymethylcytosine analyses. Could be a lot of room for deep learning applications! :smile_cat:
http://doi.org/10.1038/srep19598