Currently datasets can be stored as numpy arrays in memory or as hdf5 file on disk.
As Genomics is full of sparse datasets, it makes sense to introduce a storage option for sparse data.
To this end, a SparseGenomicArray needs to be implemented that utilizes scipy.sparse matrices.
Currently datasets can be stored as numpy arrays in memory or as hdf5 file on disk. As Genomics is full of sparse datasets, it makes sense to introduce a storage option for sparse data.
To this end, a SparseGenomicArray needs to be implemented that utilizes scipy.sparse matrices.