theislab / chemCPA

Code for "Predicting Cellular Responses to Novel Drug Perturbations at a Single-Cell Resolution", NeurIPS 2022.
https://arxiv.org/abs/2204.13545
MIT License
97 stars 24 forks source link

which molecular embedding was ultimately chosen? #143

Closed bhomass closed 11 months ago

bhomass commented 1 year ago

Out of the many drug embedding methods, which was behind you published results in fig 2 and 4?

MxMstrmn commented 1 year ago

I am citing this passage from the "Perturbation Network" section of the paper:

The molecule encoder G can be any encoding network that maps molecular representations to fixed-size embeddings. Due to the limited number of drugs available in scRNA-seq HTSs, we propose to rely on a pretrained encoding model and freeze G during training. We tested multiple different options for G and include a detailed benchmark in the Appendix A.1. We found that RDKit features performed well in our setting and report all following results for chemCPA with RDKit as the molecule encoder G.

bhomass commented 1 year ago

Thank you!