theislab / kBET

An R package to test for batch effects in high-dimensional single-cell RNA sequencing data.
Apache License 2.0
154 stars 23 forks source link

weird(opposite) result #42

Closed james20141606 closed 5 years ago

james20141606 commented 5 years ago

Hi, I enjoy your tool very much! However, I encountered a weird problem dealing with a small sample size problem. I found that kBET may not be very stable when quantifying batch effect? I found that the score may be exactly 0 or 1 and the scores may be totally opposite to intuition and visualization. Is it possible kBET requires a big sample size to work?

mbuttner commented 5 years ago

Hi, kBET is a stochastic approach, which leads to some variation in the results, but jumping from 0 to 1 and back was not among the results of my tests. However, kBET is not suited for 'small' dataset samples (where small means less than ~20 samples per batch). In this regime, the test statistic is not reliable. I do not know how small your dataset is, therefore, I can't be more specific.

Potentially, you may consider silhouette and PCregression as alternative scores for you batch effect.