[results] "Weights" of target genes in regulons

aertslab / pySCENIC

pySCENIC is a lightning-fast python implementation of the SCENIC pipeline (Single-Cell rEgulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data.

GNU General Public License v3.0

440 stars 182 forks source link

Hi @Annika18 ,

Sorry for the delay in response. Yes, the weights for the target genes come from the importance scores from the GRN step. I think that parts of this comment will answer your questions, but feel free to ask if something isn't clear.

The scores are probably best explained in 10.1038/s41596-020-0336-2 (Box 2 within):

Given the pre-calculated whole-genome rankings for a comprehensive motif collection, motif discovery for a given set of genes as input (typically referred to as a gene signature) involves scanning the database for rankings in which the top-ranked fraction is enriched for this input set of genes. More specifically, the cumulative recovery of the foreground set in a whole-genome ranking is quantified using an AUC statistic. The AUC values are standardized (i.e., by mean subtraction and scaling by the standard deviation) and expressed as NESs. Motifs associated with an NES >3.0 are considered as enriched for the supplied signature. This corresponds to a FDR of 3–9% (ref. 13).

aertslab / pySCENIC

[results] "Weights" of target genes in regulons #190