running out of memory - Githubissues

ChangSuBiostats / CS-CORE_python

Python package for CS-CORE, a statistical method for cell-type-specific co-expression inference from single cell RNA-sequencing data

MIT License

5 stars 0 forks source link

Hi Daniel,

Thanks for your interest in our work & thank you for brining this issue to our attention!

This is likely due to the fact that our previous implementation was designed to take count matrix in the format of a numpy array, and matrix multiplication was implemented with numpy functions. However, np.dot behaves unexpectedly when the count matrix is in the format of scipy csr_matrix. It generates a large number of p*p matrices, instead of computing a dot product, which may explain the memory issue.

We have updated the implementation to take AnnData / csr_matrix as input. You can follow this notebook for an example, or CSCORE_IRLS.py for the actual implementations. We have also benchmarked the time and memory usage of this new implementation at this notebook. This implementation should give comparable speed as the R version.

I hope this helps! Feel free to leave a comment if you have more questions.

Best, Chang

ChangSuBiostats / CS-CORE_python

running out of memory #2