awesome-deep-learning-single-cell-papers
This repository keeps track of the latest papers on single-cell analysis with deep learning methods. We categorize them based on individual tasks.
We will try to make this list updated. If you find any error or any missed paper, please don't hesitate to open an issue or pull request.
Citation
Be free to refer to our comprehensive survey paper on Deep Learning in Single-cell Analysis:
@article{molho2024deep,
title={Deep learning in single-cell analysis},
author={Molho, Dylan and Ding, Jiayuan and Tang, Wenzhuo and Li, Zhaoheng and Wen, Hongzhi and Wang, Yixin and Venegas, Julian and Jin, Wei and Liu, Renming and Su, Runze and others},
journal={ACM Transactions on Intelligent Systems and Technology},
volume={15},
number={3},
pages={1--62},
year={2024},
publisher={ACM New York, NY}
}
For the foundation model for single-cell, more papers are recorded [HERE].
Book
- [Single Cell Best Practices], Fabian Theis's Lab
- [Basics of Single-Cell Analysis with Bioconductor], Bioconductor software based on R
Single Cell Techonoly
Single-Modality
Multimodality
Spatial Transcriptomic
- [2022 Nature Methods] Museum of spatial transcriptomics [paper]
Course
- [CSCI 1850 Deep Learning in Genomics], Brown University
- [Machine Learning in Genomics: Dissecting Human Disease Circuitry], MIT
- [ANALYSIS OF SINGLE CELL RNA-SEQ DATA], course by Orr Ashenberg, Dana Silverbush, Kirk Gosik
- [Analysis of single cell RNA-seq data, www.singlecellcourse.org] - step-by-step scRNA-seq analysis course. R-based, with code examples, explanations, exercises. From alignment (STAR) and QC (FASTQC) to introduction to R, SingleCellExperiment class,
scater
object, data exploration (reads, UMI), filtering, normalization (scran
), batch effect removal (RUV
, ComBat
, mnnCorrect
, GLM, Harmony
), clustering and marker gene identification (SINCERA
, SC3
, tSNE, Seurat
), feature selection (M3Drop::M3DropConvertData
, BrenneckeGetVariableGenes
), pseudotime analysis (TSCAN
, Monocle
, diffusion maps, SLICER
, Ouija
, destiny
), imputation (scImpute
, DrImpute
, MAGIC
), differential expression (Kolmogorov-Smirnov, Wilcoxon, edgeR
, Monocle
, MAST
), data integration (scmap
, cell-to-cell mapping, Metaneighbour
, mnnCorrect
, Seurat
's canonical correllation analysis). Search for scRNA-seq data (scfind R package), as well as Hemberg group’s public datasets. Seurat chapter. "Ideal" scRNA-seq pipeline. Video lectures.
Paper
Andrews, Tallulah S., Vladimir Yu Kiselev, Davis McCarthy, and Martin Hemberg. "Tutorial: Guidelines for the Computational Analysis of Single-Cell RNA Sequencing Data." https://doi.org/10.1038/s41596-020-00409-w Nature Protocols, December 7, 2020.
Survey
- [2023 Biophysics Reviews] Deep learning in spatial transcriptomics: Learning from the next next-generation sequencing [paper]
Pretrained Model or LLM or Foundation Model
Refer more details to [foundation-model-single-cell-papers]
- [2024 BioRxiv] scPRINT: pre-training on 50 million cells allows robust gene network predictions [paper]
- [2024 ICLR] BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs [paper]
- [2023 bioRxiv] CellPLM: Pre-training of Cell Language Model Beyond Single Cells [paper]
- [2023 bioRxiv] DNABERT-2: EFFICIENT FOUNDATION MODEL AND BENCHMARK FOR MULTI-SPECIES GENOME [paper]
- [2023 bioRxiv] The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 [paper]
- [2023 bioRxiv] Augmenting large language models with chemistry tools [paper]
- [2023 bioRxiv] GET: a foundation model of transcription across human cell types [paper]
- [2023 bioRxiv] Cell2Sentence: Teaching Large Language Models the Language of Biology [paper]
- [2023 bioRxiv] Evaluating the Utilities of Large Language Models in Single-cell Data Analysis [paper]
- [2023 arxiv] Towards Generalist Biomedical AI [paper]
- [2023 bioRxiv] Contextualizing protein representations using deep learning on protein networks and single-cell data [paper]
- [2023 Nature] Large language models encode clinical knowledge [paper]
- [2023 Nature Methods] Towards foundation models of biological image segmentation [paper]
- [2023 bioRxiv] DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting Specific Proteins [paper]
- [2023 arxiv] Hyena Hierarchy: Towards Larger Convolutional Language Models [paper]
- [2023 bioRxiv] Population-level integration of single-cell datasets enables multi-scale analysis across samples [paper]
- [2023 bioRxiv] Large Scale Foundation Model on Single-cell Transcriptomics [paper]
- [2023 Bioinformatics] Applications of transformer-based language models in bioinformatics: a survey [paper]
- [2023 Nature] Transfer learning enables predictions in network biology [paper]
- [2023 arxiv] BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks [paper]
- [2023 arxiv] Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding [paper]
- [2023 arxiv] CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models [paper]
- [2023 iSchience tGPT] Generative pretraining from large-scale transcriptomes for single-cell deciphering [paper]
- [2023 bioRxiv] GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information [paper]
- [2023 Github] OpenBioMed [Github]
- [2023 blog] BioMedLM: a Domain-Specific Large Language Model for Biomedical Text [blog]
- [2023 bioRxiv] scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI [paper]
- [2023 bioRxiv] xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data [paper]
- [2023 Nature Biotechnology] Large language models generate functional protein sequences across diverse families [paper]
- [2022 arxiv] A single-cell gene expression language model [paper]
- [2022 Briefings in Bioinformatics] BioGPT: generative pre-trained transformer for biomedical text generation and mining [paper]
- [2022 Nature Machine Intelligence] scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data [paper]
- [2022 bioRxiv] scFormer: a universal representation learning approach for single-cell data using transformers [paper]
- [2022 Bioinformatics] scPretrain: multi-task self-supervised learning for cell-type classification [paper]
- [2021 PNAS] Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences [paper]
- [2021 Bioinformatics] DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome [paper]
- [2021 Arxiv, 576 citations] Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing [paper]
- [2021 Arxiv, 1111 citations] Don't Stop Pretraining: Adapt Language Models to Domains and Tasks [paper]
GAN or Diffusion Model
- [2024 Brief Bioinform] stDiff: a diffusion model for imputing spatial transcriptomics through single-cell transcriptomics [paper]
- [2024 biorxiv] scDiffEq: drift-diffusion modeling of single-cell dynamics with neural stochastic differential equations [paper]
- [2024 biorxiv] scDiffusion: conditional generation of high-quality single-cell data using diffusion model [paper]
- [2024 biorxiv] In Silico Generation of Gene Expression profiles using Diffusion Models [paper]
- [2024 Cell] A programmable reaction-diffusion system for spatiotemporal cell signaling circuit design [paper]
- [2023 ICCV] Scalable Diffusion Models with Transformers [paper]
- [2023 biorxiv] From Noise to Knowledge: Probabilistic Diffusion-Based Neural Inference of Gene Regulatory Networks [paper]
- [2023 biorxiv Diffusion] A General Single-Cell Analysis Framework via Conditional Diffusion Generative Models [paper]
- [2023 biorxiv GAN] Predicting cell morphological responses to perturbations using generative modeling [paper]
- [2023 Nature Diffusion Model] AI tools are designing entirely new proteins that could transform medicine [paper]
- [2023 biorxiv Diffusion Model] The Power of Two: integrating deep diffusion models and variational autoencoders for single-cell transcriptomics analysis [paper]
- [2023 biorxiv GAN] Scalable Integration of Multiomic Single Cell Data Using Generative Adversarial Networks [paper]
- [2023 biorxiv Diffusion Model] Spontanously breaking of symmetry in overlapping cell instance segmentation using diffusion models [paper]
Multimodal Learning
- [2024 ICLR workshop NLP+Gene Expression] Joint Embedding of Transcriptomes and Text Enables Interactive Single-Cell RNA-seq Data Exploration via Natural Language [paper]
- [2024 Nature Biotechnology Image+Gene Expression] Inferring super-resolution tissue architecture by integrating spatial transcriptomics with histology [paper]
- [2023 arxiv Image+Gene Expression] Transformer with Convolution and Graph-Node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image [paper]
- [2023 arxiv multimodal] MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data [paper]
- [2023 biorxiv multimodal] Pathformer: biological pathway informed Transformer model integrating multi-modal data of cancer [paper]
- [2023 biorxiv Image+Gene Expression] Spatially Resolved Gene Expression Prediction from H&E Histology Images via Bi-modal Contrastive Learning [paper]
- [2023 biorxiv Image+Gene Expression] Single-cell gene expression prediction using H&E images based on spatial transcriptomics [paper]
Data Simulation
- [2023 biorxiv] GRouNdGAN: GRN-guided simulation of single-cell RNA-seq data using causal generative adversarial networks [paper]
Interpretability
- [2021 CVPR] Transformer Interpretability Beyond Attention Visualization [paper][github]
- [2021 ICML] BERTology Meets Biology: Interpreting Attention in Protein Language Models [paper]
- [2019 ACL] A Multiscale Visualization of Attention in the Transformer Model [paper] [github]
Spatialtemporal Transcriptomic
- [2024 biorxiv] Gene Trajectory Inference for Single-cell Data by Optimal Transport Metrics [paper]
- [2023 biorxiv] Uncovering developmental time and tempo using deep learning [paper]
- [2023 biorxiv] scNODE: Generative Model for Temporal Single Cell Transcriptomic Data Prediction [paper]
- [2023 biorxiv] Gene Trajectory Inference for Single-cell Data by Optimal Transport Metrics [paper]
- [2023 arxiv survey from CS field] Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook [paper]
- [2023 ICML Reference from CS field] Continuous Spatiotemporal Transformers [paper]
- [2023 arxiv multimodalities Reference from CS field] IMAGEBIND: One Embedding Space To Bind Them All [paper]
- [2023 arxiv multimodalities Reference from CS field] UnIVAL: Unified Model for Image, Video, Audio and Language Tasks [paper]
- [2023 arxiv multimodalities Reference from CS field] Meta-Transformer: A Unified Framework for Multimodal Learning [Meta-Transformer paper][viT vision Transformer paper][ImageGPT paper Generative Pretraining From Pixels]
- [2023 KDD Reference from CS field] Spatio-temporal Diffusion Point Processes [paper]
- [2023 arxiv Reference from CS field] Long-Range Transformers for Dynamic Spatiotemporal Forecasting [paper]
- [2023 Nature Communications] Generative modeling of single-cell time series with PRESCIENT enables prediction of cell trajectories with interventions [paper]
- [2023 bioRxiv] Mapping cells through time and space with moscot [paper]
- [2023 Nature Methods] Spatiotemporally resolved transcriptomics reveals the subcellular RNA kinetic landscape [paper]
- [2022 bioRxiv] Spateo: multidimensional spatiotemporal modeling of single-cell spatial transcriptomics [paper]
- [2022 ICLR Reference from CS field] UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning [paper]
- [2022 NeurIPS spatial-temporal single-cell -> spatial-temporal video] Flamingo: a Visual Language Model for Few-Shot Learning [paper]
- [2022 arxiv, image-gene expression contrastive learning] CoCa: Contrastive Captioners are Image-Text Foundation Models [paper]
- [2020 ICLR, image-gene expression pretraining] VL-BERT: Pre-training of Generic Visual-Linguistic Representations [paper]
- [2019 AAAI, image-gene expression pretraining] Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training [paper]
RNA Velocity
- [2023 Nature Methods] Deep generative modeling of transcriptional dynamics for RNA velocity analysis in single cells [paper]
Molecular Representation Learning
- [2023 ICLR] Uni-Mol: A Universal 3D Molecular Representation Learning Framework [paper]
Single Cell Perturbation or Drug Response
- [2024 biorxiv] Deep learning-based predictions of gene perturbation effects do not yet outperform simple linear methods [paper]
- [2024 ICLR] Biologically Interpretable VAE with Supervision for Transcriptomics Data Under Ordinal Perturbations [paper]
- [2024 Nature Methods] scPerturb: harmonized single-cell perturbation data [paper]
- [2023 biorxiv] Unagi: Deep Generative Model for Deciphering Cellular Dynamics and In-Silico Drug Discovery in Complex Diseases [paper]
- [2023 NeurIPS] Modelling Cellular Perturbations with the Sparse Additive Mechanism Shift Variational Autoencoder [paper]
- [2023 Nature Methods] Learning single-cell perturbation responses using neural optimal transport [paper]
- [2023 Nature Methods] Neural optimal transport predicts perturbation responses at the single-cell level [paper]
- [2023 Mol Syst Biol] Predicting cellular responses to complex perturbations in high-throughput screens [paper]
- [2023 biorxiv] Learning Perturbation-specific Cell Representations for Prediction of Transcriptional Response across Cellular Contexts [paper]
- [2023 Nature] Dissecting cell identity via network inference and in silico gene perturbation [paper]
- [2023 biorxiv Diffusion Model] The Power of Two: integrating deep diffusion models and variational autoencoders for single-cell transcriptomics analysis [paper]
- [2023 ICLR] Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information [paper]
- [2022 arxiv] PerturbNet predicts single-cell responses to unseen chemical and genetic perturbations [paper]
- [2022 arxiv] CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data [paper]
- [2022 NeurIPS] Predicting Cellular Responses to Novel Drug Perturbations at a Single-Cell Resolution [paper]
- [2022 biorxiv] GEARS: Predicting transcriptional outcomes of novel multi-gene perturbations [paper]
- [2021 biorxiv] Learning interpretable cellular responses to complex perturbations in high-throughput screens [paper]
- [2019 Nature Methods] scGen predicts single-cell perturbation responses [paper]
Cellular Dynamics
- [2023 Genome Biology] scTour: a deep learning architecture for robust inference and accurate prediction of cellular dynamics [paper]
Single Cell Application
- [2023 medrxiv] Single-cell RNA sequencing of human tissue supports successful drug targets [paper]
- [2023 Nature Methods] Machine learning in rare disease [paper]
- [2023 Molecular System Biology] Single-cell biology: what does the future hold? [paper]
- [2023 Genes] Single-Cell Analysis in the Omics Era: Technologies and Applications in Cancer [paper]
- [2023 Nature Communications] ASGARD is A Single-cell Guided Pipeline to Aid Repurposing of Drugs [paper]
- [2023 Nature Reviews Clinical Oncology] Advancing CAR T cell therapy through the use of multidimensional omics data [paper]
Tools For Single Cell or Spatial Data
[Tool Summary]
- [2024 biorxiv] Scvi-hub: an actionable repository for model-driven single cell analysis [paper]
- [2024 Nature Methods] SpatialData: an open and universal data framework for spatial omics [paper]
- [2023 Nucleic Acids Research] DeepBIO: an automated and interpretable deep-learning platform for high-throughput biological sequence prediction, functional annotation and visualization analysis [paper]
- [2023 Github] SpatialTis: an ultra-fast spatial analysis toolkit for large-scale spatial single-cell data. [github]
- [2023 biorxiv] CellContrast: Reconstructing Spatial Relationships in Single-Cell RNA Sequencing Data via Deep Contrastive Learning [paper]
Single Cell Atlas
- [2023 Nature] A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain [paper]
- [2023 Nature] A spatially resolved single-cell genomic atlas of the adult human breast [paper]
- [2023 Nature Medicine] An integrated cell atlas of the lung in health and disease [paper]
- [2022 Nucleic Acids Research] Aquila: a spatial omics database and analysis platform [paper]
- [Cellxgene Datasets: 546 datasets by 2022]
- [2022 Nature Methods] Benchmarking atlas-level data integration in single-cell genomics [paper]
- [2022 bioRxiv] A unified analysis of atlas single cell data [paper]
- [2022 Nature Biotechnology] Integration of spatial and single-cell transcriptomic data elucidates mouse organogenesis [paper]
- [2022 bioRxiv] Supervised spatial inference of dissociated single-cell data with SageNet [paper]
- [2022 Nature Communications] Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space [paper]
Single Cell Visualization
- [Chanzuckerberg: An interactive explorer for single-cell transcriptomics data]
- [UCSC Cell Browser]
- [Cytoscape]
- [UCSC Xena]
- [ASAP: Automated Single-cell Analysis Pipeline]
- [GenePattern]
- [Loopy Browser]
Benchmarking
- [2023 biorxiv] Benchmarking the translational potential of spatial gene expression prediction from histology [paper]
- [2023 bioRxiv] Systematic benchmarking of imaging spatial transcriptomics platforms in FFPE tissues [paper]
- [2023 bioRxiv] Benchmarking multi-omics integration algorithms across single-cell RNA and ATAC data [paper]
- [2023 bioRxiv] BEND: Benchmarking DNA Language Models on biologically meaningful tasks [paper]
- [2023 Genome Biology] Benchmarking algorithms for joint integration of unpaired and paired single-cell RNA-seq and ATAC-seq data [paper]
- [2023 Nature Communications] A comprehensive benchmarking with practical guidelines for cellular deconvolution of spatial transcriptomics [paper]
- [2023 bioRxiv] Universal preprocessing of single-cell genomics data [paper]
- [2023 Genome Biology] Meta-analysis of (single-cell method) benchmarks reveals the need for extensibility and interoperability [paper]
- [2023 Nature Communications] A comprehensive benchmarking with practical guidelines for cellular deconvolution of spatial transcriptomics [paper]
- [2023 bioRxiv] Benchmarking the Autoencoder Design for Imputing Single-Cell RNA Sequencing Data [paper]
- [2023 bioRxiv] Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data [paper]
- [2022 Nature Communications] Comparison of methods and resources for cell-cell communication inference from single-cell RNA-Seq data [paper]
- [2022 Nature Methods] Benchmarking atlas-level data integration in single-cell genomics [paper]
- [2022 Nature Methods] Benchmarking spatial and single-cell transcriptomics integration methods for transcript distribution prediction and cell type deconvolution [paper]
- [2022 BioRxiv] Benchmarking Automated Cell Type Annotation Tools for Single-cell ATAC-seq Data [paper]
- [2022 Briefings in Bioinformatics] Benchmarking methods for detecting differential states between conditions from multi-subject single-cell RNA-seq data [paper]
- [2022 Nucleic Acids Research] scIMC: a platform for benchmarking comparison and visualization analysis of scRNA-seq data imputation methods [paper]
- [2021 Frontiers in Genetics] Evaluating the Reproducibility of Single-Cell Gene Regulatory Network Inference Algorithms [paper]
- [2021 Nature Communications] A benchmark study of simulation methods for single-cell RNA sequencing data [paper]
- [2021 Genome Biology] Benchmarking UMI-based single-cell RNA-seq preprocessing workflows [paper]
- [2020 Nature Methods] Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data [paper]
- [2020 Genome Biology] A benchmark of batch-effect correction methods for single-cell RNA sequencing data [paper]
- [2020 Nature Biotechnology] A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples [paper]
- [2019 Nature Methods] Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments [paper]
Metric Design
- [2019 Narure Methods] A test metric for assessing single-cell RNA-seq batch correction [paper]
Subcellular Analysis
- [2024 Nature Communications] BIDCell: Biologically-informed self-supervised learning for segmentation of subcellular spatial transcriptomics data [paper]
- [2023 Nature Methods] Spatiotemporally resolved transcriptomics reveals the subcellular RNA kinetic landscape [paper]
- [2023 biorxiv] Bering: joint cell segmentation and annotation for spatial transcriptomics with transferred graph embeddings [paper]
- [2023 Bioinformatics] FISHFactor: a probabilistic factor model for spatial transcriptomics data with subcellular resolution [paper]
- [2023 Science] Spatially resolved single-cell translatomics at molecular resolution [paper]
- [2023 Nature Methods] Subcellular omics: a new frontier pushing the limits of resolution, complexity and throughput [paper]
- [2022 BioRxiv] Bento: A toolkit for subcellular analysis of spatial transcriptomics data [paper]
- [2022 BioRxiv] Subcellular spatially resolved gene neighborhood networks in single cells [paper]
- [2022 bioRxiv] Statistical analysis supports pervasive RNA subcellular localization and alternative 3’ UTR regulation [paper]
- [2019 Cell] Atlas of Subcellular RNA Localization Revealed by APEX-Seq [paper]
Dimensionality Reduction and Visualization
- [2023 Genome Research] Complex hierarchical structures in single-cell genomics data unveiled by deep hyperbolic manifold learning [paper]
- [2021 Nature Communications] Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces [paper]
- [2018 Nature Communications] Interpretable dimensionality reduction of single cell transcriptome data with deep generative models [paper]
Representation Learning
- [2023 Nature Machine Intelligence] Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers [paper]
- [2023 Genome Biology] Correcting gradient-based interpretations of deep neural networks for genomics [paper]
- [2023 Nature Methods] SIMBA: single-cell embedding along with features [paper]
- [2023 bioRxiv] Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN [paper]
- [2021 Current Opinion in Systems Biology] Graph representation learning for single-cell biology [paper]
- [2020 Nature Communications] Realistic in silico generation and augmentation of single-cell RNA-seq data using generative adversarial networks [paper]
- [2019 Nature Methods] Data denoising with transfer learning in single-cell transcriptomics [paper]
- [2018 Nature Methods] Deep generative modeling for single-cell transcriptomics [paper]
Batch Effect Correction
- [2023 Bioinformatics] CLAIRE: contrastive learning-based batch correction framework for better balance between batch mixing and preservation of cellular heterogeneity [paper]
- [2020 Genomy Biology] A benchmark of batch-effect correction methods for single-cell RNA sequencing data [paper]
- [2020 Nature Biotechnology] A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples [paper]
- [2019 Nature Methods, Harmony] Fast, sensitive and accurate integration of single-cell data with Harmony [paper]
- [2018 Nature Biotechnology, CCA] Integrating single-cell transcriptomic data across different conditions, technologies, and species [paper]
- [2018 Nature Biotechnology, Mutual Nearest Neighbors] Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors [paper]
- [2018 Nature Methods] A test metric for assessing single-cell RNA-seq batch correction [paper]
- [2017 Nature Biotechnology] Multiplexed droplet single-cell RNA-sequencing using natural genetic variation [paper]
Tumor Microenvironment-TME
- [2023 bioRxiv] Identifying Spatial Co-occurrence in Healthy and InflAmed tissues (ISCHIA) [paper]
- [2023 bioRxiv] Predicting tumor immune microenvironment and checkpoint therapy response of head & neck cancer patients from blood immune single-cell transcriptomics [paper]
- [2022 Nature Biomedical Engineering] Graph deep learning for the characterization of tumour microenvironments from spatial protein profiles in tissue specimens [paper]
- [2022 Nature Communications] SOTIP is a versatile method for microenvironment modeling with spatial omics data [paper]
Cell-Cell Communication Events
- [2024 Nature Methods] Unsupervised and supervised discovery of tissue cellular neighborhoods from cell phenotypes [paper]
- [2024 Nature Reviews Genetics] The diversification of methods for studying cell–cell interactions and communication [paper]
- [2024 bioRxiv] Large-scale characterization of cell niches in spatial atlases using bio-inspired graph learning [paper]
- [2024 Pac Symp Biocomput] PEPSI: Polarity measurements from spatial proteomics imaging suggest immune cell engagement [paper]
- [2023 Cell Systems] Single-cell A/B testing for cell-cell communication [paper]
- [2023 Nature Biotechnology] Inferring cell–cell communication at single-cell resolution [paper]
- [2022 bioRxiv] scTensor detects many-to-many cell–cell interactions from single cell RNA-sequencing data [paper]
- [2022 Nature Biotechnology] Modeling intercellular communication in tissues using spatial graphs of cells [paper]
- [2022 bioRxiv] Decoding functional cell–cell communication events by multi-view graph
learning on spatial transcriptomics [paper]
- [2021 Bioinformatics] Identifying signaling genes in spatial single-cell expression data [paper]
- [2020 Nature Methods] NicheNet: modeling intercellular communication by linking ligands to target genes [paper]
- [2020 Nature Communications] Predicting cell-to-cell communication networks using NATMI [paper]
- [2018 Nature] Single-cell reconstruction of the early maternal–fetal interface in humans [paper]
Gene Regulatory Network
- [2023 arxiv] DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets [paper]
- [2023 Bioinformatics] STGRNS: an interpretable transformer-based method for inferring gene regulatory networks from single-cell transcriptomic data [paper]
- [2022 Nature Machine Intelligence] Inferring transcription factor regulatory networks from single-cell ATAC-seq data based on graph neural networks [paper]
- [2022 Nature Biotechnology] Multi-omics single-cell data integration and regulatory inference with graph-linked embedding [paper]
- [2022 Biorxiv] scMEGA: Single-cell Multiomic Enhancer-based Gene Regulatory Network Inference [paper]
- [2022 Bioinformatics] High-performance single-cell gene regulatory network inference at scale: the Inferelator 3.0 [paper]
- [2022 Briefings in Bioinformatic] SIGNET: single-cell RNA-seq-based gene regulatory network prediction using multiple-layer perceptron bagging [paper]
- [2020 Nature Methods] Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data [paper]
- [2019 Genome Biology] Single-cell transcriptomics unveils gene regulatory network plasticity [paper]
- [2017 Cell Syst] Gene Regulatory Network Inference from Single-Cell Data Using Multivariate Information Measures [paper]
Imputation
- [2018 Nature Communications] An accurate and robust imputation method scImpute for single-cell RNA-seq data [paper]
- [2019 Genome Biology] DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data [paper]
- [2018 Cell] Recovering Gene Interactions from Single-Cell Data Using Data Diffusion [paper]
- [2018 Genome Biology] VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies [paper]
- [2021 PLOS Computational Biology] G2S3: A gene graph-based imputation method for single-cell RNA sequencing data [paper]
- [2021 Nature Communications] scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses [paper]
- [2021 iScience] Imputing single-cell RNA-seq data by combining graph convolution and autoencoder neural networks [paper]
- [2022 PLOS ONE] Single-cell specific and interpretable machine learning models for sparse scChIP-seq data imputation [paper]
Spatial Domain
- [2023 Nature Genetics] SPICEMIX enables integrative single-cell spatial modeling of cell identity [paper]
- [2023 bioRxiv] CellCharter: a scalable framework to chart and compare cell niches across multiple samples and spatial -omics technologies [preprint]
- [2022 Genome Research] A model-based constrained deep learning clustering approach for spatially resolved single-cell data [paper]
- [2022 Nature Communications Biology] Deciphering tissue structure and function using spatial transcriptomics [Review paper]
- [2022 Genome Biology] Statistical and machine learning methods for spatially resolved transcriptomics data analysis [Review paper]
- [2022 Nature Communications] Deciphering spatial domains from spatially resolved transcriptomics with adaptive graph attention auto-encoder [paper]
- [2022 Nature Computational Science] Cell clustering for spatial transcriptomics data with graph neural networks [paper]
- [2022 Frontiers in Genetics] Analysis and Visualization of Spatial Transcriptomic Data [paper]
- [2021 Nature Methods] SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network [paper]
- [2021 Nature Biotechnology] Spatial transcriptomics at subspot resolution with BayesSpace [paper]
- [2021 Biorxiv] Unsupervised Spatially Embedded Deep Representation of Spatial Transcriptomics [paper]
- [2021 Genome Biology] Giotto: a toolbox for integrative analysis
and visualization of spatial expression data [Tool]
- [2021 Biorxiv] Define and visualize pathological architectures of human tissues from spatially resolved transcriptomics using deep learning [paper]
- [2020 Biorxiv] stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues [paper]
- [2018 Nature Methods] SpatialDE: Identification of Spatially Variable Genes [paper]
- [2018 Nature Biotechnology] Identification of Spatially Associated Subpopulations by Combining scRNAseq and Sequential Fluorescence In Situ Hybridization Data [paper]
- [2008 Journal of Statistical Mechanics] Fast unfolding of community hierarchies in large networks [paper]
Reference Embedding or Transfer Learning
- [2019 Nature Methods] Data denoising with transfer learning in single-cell transcriptomics [paper]
- [2018 Nature Methods] Deep generative modeling for single-cell transcriptomics [paper]
- [2020 Bioinformatics] Conditional out-of-distribution generation for unpaired data using transfer VAE [paper]
- [2021 Nature Biotechnology] Mapping single-cell data to reference atlases by transfer learning [paper]
- [2021 Molecular Systems Biology] Probabilistic harmonization and annotation of single-cell transcriptomics data with deep generative models [paper]
- [2022 bioRxiv Preprint] Biologically informed deep learning to infer gene program activity in single cells [preprint]
Cell Segmentation
- [2023 biorxiv] Bering: joint cell segmentation and annotation for spatial transcriptomics with transferred graph embeddings [paper]
- [2022 Cytometry A] MIRIAM: A machine and deep learning single-cell segmentation and quantification pipeline for multi-dimensional tissue images [paper][code](MIRIAM)
- [2021 Nature Biotechnology] Cell segmentation in imaging-based spatial transcriptomics [paper]
- [2021 Biorxiv] Scellseg: a style-aware cell instance segmentation tool with pre-training and
contrastive fine-tuning [paper] [code]
- [2021 Nature Biotechnology] Cell segmentation in imaging-based spatial transcriptomics [paper] [code](Baysor)
- [2021 Nature Biotechnology] Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning [paper] [code](Memser)
- [2021 Nature Methods] Cellpose: a generalist algorithm for cellular segmentation [paper] [code](Cellpose)
- [2021 Molecular Systems Biology]Joint cell segmentation and cell type annotation for spatial transcriptomics [paper] [code] (JSTA)
- [2020 Nature Communications]A convolutional neural network segments yeast microscopy images with high accuracy [paper] [code]
- [2020 Medical Image Analysis] DeepDistance: A multi-task deep regression model for cell detection in inverted microscopy images [paper] (DeepDistance)
- [2016 Computational Biology]Deep Learning Automates the Quantitative Analysis of Individual Cells in Live-Cell Imaging Experiments [paper] [code] (Deepcell)
Cell Type Deconvolution
- [2023 Genome Biology] Smoother: a unified and modular framework for incorporating structural dependency in spatial omics data [paper]
- [2023 BioRxiv] RETROFIT: REFERENCE-FREE DECONVOLUTION OF CELL-TYPE MIXTURES IN SPATIAL TRANSCRIPTOMICS [paper]
- [2023 BioRxiv] STdGCN: accurate cell-type deconvolution using graph convolutional networks in spatial transcriptomic data [paper]
- [2023 BioRxiv] Spotless: a reproducible pipeline for benchmarking cell type deconvolution in spatial transcriptomics [paper]
- [2022 Nature Biotechnology] High-resolution alignment of single-cell and spatial transcriptomes with CytoSPACE [paper]
- [2022 Nature Communications] Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data [paper]
- [2022 Nature Biotechnology] DestVI identifies continuums of cell types in spatial transcriptomics data [paper]
- [2022 Biorxiv] Accurate cell type deconvolution in spatial transcriptomics using a batch effect-free strategy [paper]
- [2022 Nature Biotechnology] Spatially informed cell-type deconvolution for spatial transcriptomics [paper]
- [2022 Nature Cancer] Cell type and gene expression deconvolution with BayesPrism enables Bayesian integrative analysis across bulk and single-cell RNA sequencing in oncology [paper]
- [2022 Nature Communications] Advances in mixed cell deconvolution enable quantification of cell types in spatial transcriptomic data [paper]
- [2022 Nature Biotechnology] Cell2location maps fine-grained cell types in spatial transcriptomics [paper]
- [2021 Briefings in Bioinformatics] DSTG: deconvoluting spatial transcriptomics data through graph-based artificial intelligence [paper]
- [2021 Genome Research] Likelihood-based deconvolution of bulk gene expression data using single-cell references [paper]
- [2021 Genome Biology] SpatialDWLS: accurate deconvolution of spatial transcriptomic data [paper]
- [2021 Nucleic Acids Research] SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes [paper]
- [2021 Nature Biotechnology] Robust decomposition of cell type mixtures in spatial transcriptomics [paper]
- [2019 Nature Communications] Accurate estimation of cell-type composition from gene expression data [paper]
- [2019 Science] Slide-seq: A scalable technology for measuring genome-wide expression at high spatial resolution [paper]
Cell Type Annotation
- [2023 biorxiv] Scaling cross-tissue single-cell annotation models [paper]
- [2023 Nature Methods] Multi-layered maps of neuropil with segmentation-guided contrastive learning [paper]
- [2023 Nature Methods] Cue: a deep-learning framework for structural variant discovery and genotyping [paper]
- [2023 Nature Communications] Transformer for one stop interpretable cell type annotation [paper]
- [2023 Nature Biotech] TACCO unifies annotation transfer and decomposition of cell identities for single-cell and spatial omics [paper]
- [2022 Nature Method] Annotation of spatially resolved single-cell data with STELLAR [paper]
NOTE: annotated reference cell graph + query cell graph
- [2022 Brief Bioinform] scIAE: an integrative autoencoder-based ensemble classification framework for single-cell RNA-seq data [paper]
- [2022 Nature Communications] scGCN is a graph convolutional networks algorithm for knowledge transfer in single cell omics [paper]
- [2022 Science] Cross-tissue immune cell analysis reveals tissue-specific features in humans [paper]
- [2022 Bioinformatics] CellMeSH: probabilistic cell-type identification using indexed literature [paper]
- [2022 Cancers] Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions [paper]
- [2021 Nucleic Acids Research] scDeepSort: a pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network [paper]
- [2021 BMC Bioinformatics] Single-cell classification using graph convolutional networks [paper]
- [2021 Genome Research] Semisupervised adversarial neural networks for single-cell classification [paper]
- [2020 BMC Bioinformatics] EnClaSC: a novel ensemble approach for accurate and robust cell-type classification of single-cell transcriptomes [paper]
- [2020 Bioinformatics] ACTINN: automated identification of cell types in single cell RNA sequencing [paper]
- [2020 Nature Communications] SciBet as a portable and fast single cell type identifier [paper]
- [2019 Nucleic Acids Research] SuperCT: a supervised-learning framework for enhanced characterization of single-cell transcriptomic profiles [paper]
- [2019 Nucleic Acids Research] CHETAH: a selective, hierarchical cell type identification method for single-cell RNA sequencing [paper]
- [2019 Bioinformatics] scMatch: a single-cell gene expression profile annotation tool using reference datasets [paper]
- [2019 Cell Systems] SingleCellNet: A Computational Tool to Classify Single Cell RNA-Seq Data Across Platforms and Across Species [paper]
- [2019 Genome Biology] SingleCellNet: cPred: accurate supervised method for cell-type classification from single-cell RNA-seq data [paper]
Cell Clustering
- [2023 Bioinformatics] scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering [paper]
- [2023 bioRxiv] G3DC: a Gene-Graph-Guided selective Deep Clustering method for single cell RNA-seq data [paper]
- [2022 BMC Bioinformatics] SC3s: efficient scaling of single cell consensus clustering to millions of cells [paper]
- [2022 Bioinformatics] GNN-based embedding for clustering scRNA-seq data [paper]
- [2022 AAAI] ZINB-based Graph Embedding Autoencoder for Single-cell RNA-seq Interpretations [paper]
- [2022 Briefings in Bioinformatics] Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network [paper]
- [2022 Bioinformatics] scGAC: a graph attentional architecture for clustering single-cell RNA-seq data [paper]
- [2022 Nature Computational Science] Cell clustering for spatial transcriptomics data with graph neural networks [paper]
- [2021 Nature Communications] Model-based deep embedding for constrained clustering analysis of single cell RNA-seq data [paper]
- [2020 NAR Genomics and Bioinformatics] Deep soft K-means clustering with self-training for single-cell RNA sequence data [paper]
- [2020 Nature Communications] Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis [paper][website][github]
- [2019 Nature Machine Intelligence] Clustering single-cell RNA-seq data with a model-based deep learning approach [paper]
Disease Prediction
- [2024 Nature Biotechnology] Can single-cell biology realize the promise of precision medicine? [paper]
- [2018 IJCAI] Hybrid Approach of Relation Network and Localized Graph Convolutional Filtering for Breast Cancer Subtype Classification [paper]
- [2021 NPJ Digital Medicine] DeePaN - A deep patient graph convolutional network integratingclinico-genomic evidence to stratify lung cancers benefiting from immunotherapy [paper]
- [2022 Biocumputing] CloudPred: Predicting Patient Phenotypes From Single-cell RNA-seq [paper]
- [2022 CHIL '20: Proceedings of the ACM Conference on Health, Inference, and Learning] Disease state prediction from single-cell data using graph attention networks [paper]
Multimodal Integration
- [2024 Nature Methods] Search and match across spatial omics samples at single-cell resolution [Paper]
- [2023 Nature Biotechnology] Integration of spatial and single-cell data across modalities with weakly linked features [Paper]
- [2023 Nature Communications] scDREAMER for atlas-level integration of single-cell datasets using deep generative model paired with adversarial classifier [Paper]
- [2023 biorxiv] Automated single-cell omics end-to-end framework with data-driven batch inference [Paper]
- [2023 Nature Biotechnology] Integration of multi-modal single-cell data [Paper]
- [2023 Nature Biotechnology] Integration of spatial and single-cell data across modalities with weakly linked features [Paper]
- [2023 Briefings in Bioinformatics] A universal framework for single-cell multi-omics data integration with graph convolutional networks [Paper]
- [2022 PMLR] CVQVAE: A representation learning based method for multi-omics single cell data integration [Paper]
- [2022 Nature Biotechnology] Multi-omics single-cell data integration and regulatory inference with graph-linked embedding [Paper]
- [2022 Nature Communications] Clustering of single-cell multi-omics data with a multimodal deep learning method [Survey]
- [2022 Genome Biology] A benchmark study of deep learning-based multi-omics data fusion methods for cancer [Survey]
- [2018 ICML] MAGAN: Aligning biological manifolds [paper]
- [2019 PLoS computational biology] Building gene regulatory networks from scATAC-seq and scRNA-seq using linked self organizing maps [paper]
- [2020 Bioinformatics] SCIM: universal single-cell matching with unpaired feature sets [paper]
- [2021 Nature communications] Multi-domain translation between single-cell imaging and sequencing data using autoencoders [paper]
- [2021 PLoS Computational Biology] Imputation of spatially-resolved transcriptomes by graph-regularized tensor completion [paper]
- [2021 Genome biology] Cobolt: integrative analysis of multimodal single-cell sequencing data [paper]
- [2021 Cell reports methods] A mixture-of-experts deep generative model for integrated analysis of single-cell multiomics data [paper]
- [2021 Briefings in Bioinformatics] Deep-joint-learning analysis model of single cell transcriptome and open chromatin accessibility data [paper]
- [2021 Bioinformatics] Deep cross-omics cycle attention model for joint analysis of single-cell multi-omics data [paper]
- [2022 Nature Biotechnology] scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning [paper]
- [2022 Bioinformatics] SMILE: mutual information learning for integration of single-cell omics data [paper]
- [2022 SIGKDD] Graph Neural Networks for Multimodal Single-Cell Data Integration [paper]
- [2022 Genome biology] scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously [paper]
- [2019 Biorxiv] A Joint Model of RNA Expression and Surface Protein Abundance in Single Cells [paper]
- [2021 Biorxiv] DeepMAPS: Single-cell biological network inference using heterogeneous graph transformer [paper]
- [2022 Biorxiv] Adaptative Machine Translation between paired Single-Cell Multi-Omics Data [paper]
- [2022 Biorxiv] Multigrate: single-cell multi-omic data integration [paper]
- [2019 NeurIPS multi-lingual pretraining for multi-omics] Cross-lingual Language Model Pretraining [Paper]
Multiomics Translation
- [2024 Nature Communications] scButterfly: a versatile single-cell cross-modality translation method via dual-aligned variational autoencoders [paper]
- [2023 arxiv scHyena] scHyena: Foundation Model for Full-Length Single-Cell RNA-Seq Analysis in Brain [paper]
- [2023 bioRxiv scTranslator] A pre-trained large language model for translating single-cell transcriptome to proteome [paper]
- [2023 Advanced Science] Efficient Generation of Paired Single-Cell Multiomics Profiles by Deep Learning [paper]
- [2022 JCB] Multimodal Single-Cell Translation and Alignment with Semi-Supervised Learning [paper]
- [2022 Nature Machine Intelligence sciPENN] A multi-use deep learning method for CITE-seq and single-cell RNA-seq data integration with cell surface protein prediction and imputation [paper]
- [2022 RECOMB] Semi-supervised Single-Cell Cross-modality Translation Using Polarbear [paper]
- [2020 PNAS] BABEL enables cross-modality translation between multiomic profiles at single-cell resolution [paper]