theislab / chemCPA

Code for "Predicting Cellular Responses to Novel Drug Perturbations at a Single-Cell Resolution", NeurIPS 2022.
https://arxiv.org/abs/2204.13545
MIT License
88 stars 23 forks source link

LINCS Trapnell gene overlap #55

Closed siboehm closed 2 years ago

siboehm commented 2 years ago

I started writing the code that does the transfer learning (loads the model pretrained on LINCS for finetuning on Trapnell) and ran into a stumbling block: LINCS has 978 genes, one of which isn't part of Trapnell. Further the ordering of the genes betw. lincs_full_smiles.h5ad and trapnell_cpa.h5ad is completely different.

Hence:

I think generating & storing the datasets is less error prone than trying to fix this in the code.

Can you do this @MxMstrmn? We can also talk about it tmrw.

siboehm commented 2 years ago

Closed by #58