The scGPS package website is available at: https://imb-computational-genomics-lab.github.io/scGPS/index.html
The usage instruction can be found at: https://imb-computational-genomics-lab.github.io/scGPS/articles/vignette.html
scGPS is a complete single cell RNA analysis framework from decomposing a mixed population into clusters (SCORE) to analysing the relationship between clusters (scGPS). scGPS also performs unsupervised selection of predictive genes defining a subpopulation and/or driving transition between subpopulations.
The package implements two new algorithms SCORE and scGPS.
Key features of the SCORE clustering algorithm
Key features of the scGPS algorithm
scGPS takes scRNA expression dataset(s) from one or more unknown sample(s) to find subpopulations and relationship between these subpopulations. The input dataset(s) contains mixed, heterogeous cells. scGPS first uses SCORE (or CORE V2.0) to identify homogenous subpopulations. scGPS contains a number of functions to verify the subpopulations identified by SCORE (e.g. functions to compare with results from PCA, tSNE and the imputation method CIDR). scGPS also has options to find gene markers that distinguish a subpopulation from the remaining cells and performs pathway enrichment analysis to annotate subpopulation. In the second stage, scGPS applies a machine learning procedure to select optimal gene predictors and to build prediction models that can estimate between-subpopulation transition scores, which are the probability of cells from one subpopulation that can likely transition to the other subpopulation.
Figure 1. scGPS workflow. Yellow boxes show inputs, and green boxes show main scGPS analysis.