satijalab / seurat

R toolkit for single cell genomics
http://www.satijalab.org/seurat
Other
2.18k stars 891 forks source link

Differences in the re-normalization of a public Seurat Object #9054

Closed evaesquinas closed 1 week ago

evaesquinas commented 1 week ago

I have downloaded a public seurat object which is already normalized. Due I want to focus on one particular subset of cells and I would like to re-cluster them, I will have to normalize. However, I realised that if I normalize the object again (the complete one that I downloaded), the normalization is not the same.

seu_downloaded <- Seurat::NormalizeData(seu_downloaded,
                     normalization.method = "LogNormalize",
                     scale.factor = 10000, assay = assays[["RNA"]]@counts)

Since I would like to focus on one subset of cells and therefore, re-cluster cells again (to study the group in deep) and do everything that is related with the process (normalization, scale, etc again), I am worried to have inconsistency in the normalization of the full seu object that I have downloaded and my subsetted/re-clustered object (done by me).

Is it recommended to normalize again all the downloaded Seurat objects from public data regardless if they are already normalized?

Any feedback will be really appreciated.

Thanks very much in advance

Gesmira commented 1 week ago

Ideally, you would normalize your objects in the same way. If your object already contains raw counts in the counts layer (in Seurat v5, you would access these like so: seu_download[["RNA"]]$counts), then NormalizeData will by default create a new data layer by normalizing the counts layer. You don't always have to normalize public data again, but in this case since you don't know how the data was originally normalized, I would recommend doing so.