I have a dataset which I have split into multiple sub-groupings of samples to run CellphoneDB on. When I run it on each subset, for 2 of the 3 subsets I get the same number of overall interactions in the means.txt output, but for one of the subsets I get less interactions. Some of these interactions that are missing from the one subset are particularly interesting to us and initially I thought the problem was with the threshold because the genes in these interactions are expressed on a lower number of cells in this subset, but after trying this it did not make a difference. Additionally, I have checked and, although some of the clusters in this subset have a very small number of cells, the percentage of cells expressing the genes for the missing interactions are actually quite high in some clusters. Is there a reason why these interactions are not being included in one subset from this dataset but are included in runs for the other 2 subsets?
Hi,
I have a dataset which I have split into multiple sub-groupings of samples to run CellphoneDB on. When I run it on each subset, for 2 of the 3 subsets I get the same number of overall interactions in the means.txt output, but for one of the subsets I get less interactions. Some of these interactions that are missing from the one subset are particularly interesting to us and initially I thought the problem was with the threshold because the genes in these interactions are expressed on a lower number of cells in this subset, but after trying this it did not make a difference. Additionally, I have checked and, although some of the clusters in this subset have a very small number of cells, the percentage of cells expressing the genes for the missing interactions are actually quite high in some clusters. Is there a reason why these interactions are not being included in one subset from this dataset but are included in runs for the other 2 subsets?
Thanks