Hello there ,I used the package to detect the batch effect of the data,for example
First, I used the Seurat to integrate the raw matrix, then fitered cell and caculated the top 2000 VariableFeatures. I used the matrix of 2000 genes as the data to search the batch effect with the kBET, here are my results:
lung1-liver1:
lung1-lung3
Reasonable, the batch effect within different tissues is higher than within the same tissues.
So,I want to know that why the rejection rates between two different tissues (lung1 vs liver1) lower than the same tissues (lung1 vs lung3).
I am looking forward to your reply. Thank you.
that's an interesting question and I agree on your reasoning. I have a couple of suggestions and questions:
Did you check the batch effects before correction?
Lung-Liver: Can you rule out overcorrection? I am wondering if lung and liver have enough cell types in common to ensure a reasonable data integration. If not, Seurat's data integration method would integrate completely distinct cell types.
Lung1-Lung3: Do you observe several cell types and, if yes, shifts in the cell type composition? In that case I advise to run kBET per cluster (or cell type) and average over all kBET results to account for frequency shifts - I wrote an example in the README, section 'Subsampling'.
Hello there ,I used the package to detect the batch effect of the data,for example
First, I used the Seurat to integrate the raw matrix, then fitered cell and caculated the top 2000 VariableFeatures. I used the matrix of 2000 genes as the data to search the batch effect with the kBET, here are my results:
lung1-liver1:
lung1-lung3
Reasonable, the batch effect within different tissues is higher than within the same tissues. So,I want to know that why the rejection rates between two different tissues (lung1 vs liver1) lower than the same tissues (lung1 vs lung3). I am looking forward to your reply. Thank you.