BIMSBbioinfo / janggu

Deep learning infrastructure for genomics
GNU General Public License v3.0
254 stars 33 forks source link

Support for sparse datasets #1

Closed wkopp closed 6 years ago

wkopp commented 6 years ago

Currently datasets can be stored as numpy arrays in memory or as hdf5 file on disk. As Genomics is full of sparse datasets, it makes sense to introduce a storage option for sparse data.

To this end, a SparseGenomicArray needs to be implemented that utilizes scipy.sparse matrices.

wkopp commented 6 years ago

This has been implemented.