I am working with scRNA-seq data that consists of various cell groups from a total of 9 samples. The dataset contains over 1 million cells. When I run StemFinder on the entire dataset, the calculation time seems to be very long.
Based on my experience, the calculation time of CCAT and Cytotrace increases linearly with the number of cells. These tools seems possible to run these algorithms in parallel on separate samples or cell groups. However, this does not appear to be the case with StemFinder due to its algorithm characteristics.
I understand that StemFinder utilizes the heterogeneity in cell cycle gene expression to estimate the potency of individual cells. It calculates the stemness of each cell based on its neighboring cells using a KNN algorithm.
My questions are:
1.
Is it possible to run StemFinder separately for each person or for a specific cell group?
2.
If so, would it be possible to combine the results obtained from separate runs for comparison?
If there are any guidelines or recommendations for dividing the dataset and comparing the results, I would greatly appreciate it if you could share them.
I am working with scRNA-seq data that consists of various cell groups from a total of 9 samples. The dataset contains over 1 million cells. When I run StemFinder on the entire dataset, the calculation time seems to be very long. Based on my experience, the calculation time of CCAT and Cytotrace increases linearly with the number of cells. These tools seems possible to run these algorithms in parallel on separate samples or cell groups. However, this does not appear to be the case with StemFinder due to its algorithm characteristics. I understand that StemFinder utilizes the heterogeneity in cell cycle gene expression to estimate the potency of individual cells. It calculates the stemness of each cell based on its neighboring cells using a KNN algorithm. My questions are: 1. Is it possible to run StemFinder separately for each person or for a specific cell group? 2. If so, would it be possible to combine the results obtained from separate runs for comparison? If there are any guidelines or recommendations for dividing the dataset and comparing the results, I would greatly appreciate it if you could share them.