CahanLab / stemfinder

estimate cell fate potency from single cell RNA-seq data
0 stars 0 forks source link

Feasibility of Running on Separate Samples/Cell Groups #1

Closed aldas79 closed 4 hours ago

aldas79 commented 2 weeks ago

I am working with scRNA-seq data that consists of various cell groups from a total of 9 samples. The dataset contains over 1 million cells. When I run StemFinder on the entire dataset, the calculation time seems to be very long. Based on my experience, the calculation time of CCAT and Cytotrace increases linearly with the number of cells. These tools seems possible to run these algorithms in parallel on separate samples or cell groups. However, this does not appear to be the case with StemFinder due to its algorithm characteristics. I understand that StemFinder utilizes the heterogeneity in cell cycle gene expression to estimate the potency of individual cells. It calculates the stemness of each cell based on its neighboring cells using a KNN algorithm. My questions are: 1. Is it possible to run StemFinder separately for each person or for a specific cell group? 2. If so, would it be possible to combine the results obtained from separate runs for comparison? If there are any guidelines or recommendations for dividing the dataset and comparing the results, I would greatly appreciate it if you could share them.

kma2112 commented 1 week ago

Thank you for your comment,

  1. Yes, stemFinder can be run on a subset of the original dataset.
  2. To run on a large dataset with many cells, we recommend the following: