digitalcytometry / cytospace

CytoSPACE: Optimal mapping of scRNA-seq data to spatial transcriptomics data
Other
112 stars 19 forks source link

Purpose of downsampling #100

Closed d-paliwal closed 4 months ago

d-paliwal commented 5 months ago

Hi! Thank you so much for your tool! I am wondering if you could please clarify what the purpose of downsampling to 1500 transcripts is. Thanks in advance!

erinlbrown commented 5 months ago

Hi, thank you for your interest in CytoSPACE! Downsampling transcripts serves as a form of normalization and ensures that the cell-to-spot assignments will be independent of the total transcript count per cell. Hope that helps clarify!

d-paliwal commented 5 months ago

Hi Erin! Thank you for your response. I see. Is this meant to serve as a batch effect correction method in the case that the single cell atlas is compiled from multiple batches, or do you recommend additional batch effect correction be performed in addition to the transcript downsampling?

erinlbrown commented 5 months ago

While the downsampling component can address batch effects, its inclusion within the method (also) targets biological variation in nonzero gene counts per cell, often related to cell type/state, so that in assessing the match between single cell and spatial profiles, certain cell types are not inherently privileged due to higher or lower sparsity in the scRNA-seq. This is particularly relevant for bulk spatial profiles such as Visium, for which the sparsity of the spatial RNA-seq profile may be more representative of the total number or heterogeneity of cells per spot, rather than the transcriptomic sparsity of individual cells. In general, we have found CytoSPACE to be highly robust to noise and moderate variation in inputs, and no other batch correction is generally recommended. Hope this helps clarify!

d-paliwal commented 5 months ago

I see. Thank you! That helps.