satijalab / sctransform

R package for modeling single cell UMI expression data using regularized negative binomial regression
GNU General Public License v3.0
208 stars 33 forks source link

Selection of the number of cells used for the NB regression in SCTransform #118

Open BiotechPedro opened 2 years ago

BiotechPedro commented 2 years ago

Hello everyone 😄

First of all, thank you for the great and easy Seurat framework in general!

By default ncells = 5000 in SCTransform(), but I am asking myself what is the more accurate number of cells for each dataset. For example, I am blindy using the 40% of the cells to build the regression. Do you have some estimates for the best percentage, or it similarly works for a broad range of ncells?

Thank you!