mojaveazure / seurat-disk

Interfaces for HDF5-based Single Cell File Formats
https://mojaveazure.github.io/seurat-disk
GNU General Public License v3.0
146 stars 48 forks source link

File size differences of AnnData .h5ad #9

Open alikhuseynov opened 4 years ago

alikhuseynov commented 4 years ago

Dear Paul, I found SeuratDisk very useful, thank you for implementing it! I converted .h5ad file (~23G) to h5seurat(~28G) to work in Seurat. Then I saved it as h5seurat (becomes ~5G) and converted to .h5ad(~3.7G) and loaded it to Scanpy. questions:

LustigePerson commented 4 years ago

Have you checked the compressionparameter from anndata.write_h5ad()? It's turned off by default. Maybe seurat is compressing by default?

alikhuseynov commented 4 years ago

I don't use Scanpy much.. it might be that SeuratDisk compresses it. Thanks