Closed BiotechPedro closed 9 months ago
Hi Pedro, Thank you very much for your interest in our work! Your two questions are crucial. I will provide my thoughts.
Best, Dongyuan
Thank you, Dongyuan!
1) I agree that we look for randomness rather than stability. However, the aggregation from several synthetic null data to get more stable DE results would be better. In which terms have you thought about it?
2) So, you would remove the cell cycle-related genes rather than regressing them out in both the original and the synthetic data, right?
Best,
Pedro
Hi Pedro,
Best, Dongyuan
I suppose that ClusterDE has a low frequency of type I errors, so the results are almost equal when adjusting by either one or multiple null scores, right?
Thank you very much for all your insights and congrats again for the method!
Best,
Pedro
Hi Dongyuan :D
Congrats for this clever solution regarding the double dipping issue!
I've read that the case scenario for ClusterDE to be applied is for '1vs1' differential expression rather than '1vsALL', which makes total sense to me. However, while running the same analytic pipeline to the null dataset for creating two artificial clusters, they will be sensitive to randomness. In other words, since the two clusters are gonna be artificial, cells in the edges could belong to cluster A with one seed and to cluster B with other seed. Have you though in implementing kind of cluster stability measures for creating stable null cluster? Would it make sense to repeat the process few times and then discard the "volatile" cells to create purer clusters?
Another question I have is how to deal with cell cycle effect. If I am comparing two original clusters obtained after regressing out the effect of the cell cycle, should I regress out that effect when running the pipeline on the null synthetic null data?
Best regards,
Pedro