JinmiaoChenLab / Batch-effect-removal-benchmarking

A benchmark of batch-effect correction methods for single-cell RNA sequencing data
70 stars 45 forks source link

A question about input file of kBET #4

Open yuGithuuub opened 4 years ago

yuGithuuub commented 4 years ago

Hello : Thanks for your wonderful job! I have a question about the input file of kBET algorithm. I noticed that the input file of kBET is the PCA embedding matrix of intergrated object , instead of the cell_feature matrix. So, I tested the following 3 input files.

  1. cell_feature matrix of integrated data seurat_V3_直接用细胞.png.pdf

  2. PCA embedding matrix of intergrated data . seurat_V3_intergrated_PCA.pdf

  3. PCA embedding matrix of Raw data serat_v3_sct.pdf

It looks better to use PCA embedding as the input file. Why is this?

nhuhoa commented 2 years ago

Hi,

First of all, we did the benchmark of various batch effects methods, and some methods are applied in raw data genes x cells space, while some other methods only work in feature reduction spaces, so we decided to use PCA embedding matrix as input. Cells in PCA feature reduction spaces are less noisy compared to cells in gene expression matrix space - suppose that PCs can capture almost the main cells characteristics, so it is easy to understand why the output of statistical test is better in reduction space.

Hoa