Closed fbrundu closed 4 years ago
Hi @fbrundu ,
It should be fine to run all of your cells together. Even if a TF is missing in some cells it should still be detected in the analysis (if it passes the pruning thesholds, of course). This is how pySCENIC works normally anyway -- it will pick up regulons that are present in each of potentially many cell types within a heterogenous dataset.
Ok thanks @cflerin !
I have a scRNA-Seq dataset composed of 33k cells of which roughly half of the cells come from Case samples (samples having a specific disease genotype) and the other half come from healthy Controls. We are currently working with a model in which the Case genotype has some deletions that might target some TFs, i.e. the regions that contain such TFs are lost in the Case. Should the inference of regulons be run only on Controls and the AUC scoring on all cells? Or should both the inference of regulons and scoring be run always on the full set of cells?
Thanks!