QihuangZhang / CeLEry

CeLEry: cell location recovery in single-cell RNA sequencing
MIT License
26 stars 3 forks source link

CeLEry

Leveraging spatial transcriptomics data to recover cell locationsin single-cell RNA-seq with CeLEry

Qihuang Zhang, Shunzhou Jiang, Amelia Schroeder, Jian Hu, Kejie Li, Baohong Zhang, David Dai, Edward B. Lee, Rui Xiao, Mingyao Li*

Single-cell RNA sequencing (scRNA-seq) has transformed our understanding of cellular heterogeneity in health and disease, but the lack of physical relationships among dissociated cells has limited its applications. Here we present CeLEry, a supervised deep learning algorithm to recover the spatial origins of cells in scRNA-seq by leveraging gene expression and spatial location information learned from spatial transcriptomics (ST) data. CeLEry has a data augmentation procedure via variational autoencoder to improve the robustness of the method and overcome noise in scRNA-seq. CeLEry can infer the spatial origins of cells in scRNA-seq at multiple levels, including 2D location as well as the spatial domain or tissue layer of a cell. CeLEry also provides uncertainty estimates for the recovered location information. Comprehensive evaluations on multiple datasets generated from mouse and human brains show that CeLEry can reliably recover the spatial location information for cells in scRNA-seq.

CeLEry workflow

The implmentation procedure of CeLEry:

Usage

The CeLEry package is an implementation of a deep neural network in discovering location information for single cell RNA data. With CeLEry, you can:

Tutorial

A Jupyter Notebook of the tutorial is accessible from :
https://github.com/QihuangZhang/CeLEry/blob/main/tutorial/tutorial.md

The tutorial of the Biogen pretrain model can be accessible from :
https://github.com/QihuangZhang/CeLEry/blob/main/tutorial/BiogenPretrain.md

System Requirements

Hardware Requirements

The CeLEry package requires only a standard computer with enough RAM to support the operations defined by a user. For minimal performance, this will be a computer with about 2 GB of RAM. For optimal performance, we recommend a computer with the following specs:

RAM: 16+ GB
CPU: 4+ cores, 3.3+ GHz/core

Software Requirements

OS Requirements

The package development version is tested on Linux operating systems. The developmental version of the package has been tested on the following systems:

Linux: kernel 3.10.0 Mac OSX:
Windows:

System Requirements

Python (>3.8) support packages: torch>=1.8, pandas>=1.4, numpy>=1.20, scipy, tqdm, scanpy>=1.5, anndata, sklearn, scikit-image

Install packages

In the command, input

pip install CeLEryPy

The installation of CeLEry python package takes approximately 5 minumtes.