Scalability on large datasets

Tang-Lab-super / PROST

PROST: A quantitative pattern recognition framework for spatial transcriptomics.

MIT License

5 stars 1 forks source link

Hi @Sicrve11 , thanks for your contribution to PROST! Now I've successfully tested PROST on my data. But I'm stuck on analyzing large dataset that contains 50000 cells and 36 genes. Everything went smoothly until PROST.spatial_autocorrelation(adata, k = 10, permutations = None). It threw me an error which said,

Error: MemoryError: Unable to allocate 14.9 GiB for an array with shape (50000 , 50000) and data type float64

I'm working on a Windows10 machine with 64GB RAM. Below is my session info. Do you have any ideas to get the hypothesis statistics for large dataset?

Tang-Lab-super / PROST

Scalability on large datasets #4