Open agitter opened 7 years ago
The authors train two variational autoencoders (VAEs) to model cancer cell lines reaction and susceptibility to drug treatments. One of the VAEs includes a semi-supervised component that predicts treatment response. The paper presents the models and data effectively while adding several valuable computational evaluations, but the paper lacks biological interpretation. I am also not entirely sold that their reported results support one of their conclusions that doing well on reconstruction does not lead to classification improvements (although I tend to agree that reconstruction and biological meaning are not monotonic). There is no available source code as far as I can see.
The models are trained using ~600 cell line gene expression profiles using measurements for 903 landmark genes. The cell lines are measured before and after treatment by 19 drugs. Models were trained independently for each drug.
Two VAE models train with a combination of SGD and Inverse Autoregressive Flow. Both learn two distinct but linearly-related latent representations for pre and post drug perturbation. The first model, Perturbation VAE
learns to encode the response to drug treatment, while the related Drug Response VAE
or "Dr.VAE" is the semi-supervised extension that predicts drug mediated cell killing.
https://arxiv.org/abs/1706.08203
Bonus points for the title.