"celda" stands for "CEllular Latent Dirichlet Allocation", which is a suite of Bayesian hierarchical models and supporting functions to perform gene and cell clustering for count data generated by single cell RNA-seq platforms. This algorithm is an extension of the Latent Dirichlet Allocation (LDA) topic modeling framework that has been popular in text mining applications. Celda has advantages over other clustering frameworks:
To install the most recent beta release of celda via devtools:
library(devtools)
install_github("compbiomed/celda@v0.5")
The most up-to-date (but potentially less stable) version of celda can similarly be installed with:
install_github("compbiomed/celda")
NOTE On OSX, devtools::install_github() requires installation of libgit2. This can be installed via homebrew:
brew install libgit2
Vignettes are available in the package.
An analysis example using celda with RNASeq via vignette('celda-analysis')
The v0.4 release of celda represents a useable implementation of the various celda clustering models. Please submit any usability issues or bugs to the issue tracker at https://github.com/compbiomed/celda
You can discuss celda, or ask the developers usage questions, in our Google Group.