Shiyc-Lab commented 1 year ago

when running NormalizaData() using the same data, v4 would finish it soon but v5 will keep running and never stop(at least 10 hours).

R version 4.3.1 (2023-06-16) Platform: x86_64-conda-linux-gnu (64-bit) Running under: Ubuntu 20.04.1 LTS

Matrix products: default BLAS/LAPACK: /data1/users/zhoux1/.conda/envs/r4/lib/; LAPACK version 3.11.0

locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C

time zone: Asia/Shanghai tzcode source: system (glibc)

R version 4.3.1 (2023-06-16) Platform: x86_64-conda-linux-gnu (64-bit) Running under: Ubuntu 20.04.1 LTS

Matrix products: default BLAS/LAPACK: /data1/users/zhoux1/.conda/envs/r-seurat5/lib/; LAPACK version 3.11.0

locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C

time zone: Asia/Shanghai tzcode source: system (glibc)

Shiyc-Lab commented 1 year ago

the size of data is 500000*20000

Gesmira commented 1 year ago

Hi, A user previously reported a similar issue here. Can you confirm the type of your counts matrices? What does class(obj[["RNA"]]$counts) return? The fix for them was to convert their counts matrices to dgCMatrices by running the following for each layer: obj[["RNA"]]$counts <- as(obj[["RNA"]]$counts, "CsparseMatrix")

Shiyc-Lab commented 1 year ago

Well, the type is actually dgCMatrices. image

I downloaded raw counts matrices and using CreateSeuratObject(). image

