Closed jzhoubu closed 4 months ago
We have decided to continue using the scipy .npz
format for storing sparse indices. While we utilize torch sparse tensors for inner product operations, storing sparse indices in this format presents two significant issues:
.npz
format, likely due to the use of int64 for indices and float32 for values.
Goals
To streamline the index managemen by unifying the index types (to torch csr tensor) and the save & load functions.
Trade-off
The current
.npz
format fromscipy
is advantageous for its compression with minimal disk usage. Transition from scipy.npz
format to the pytorch.pt
comes with larger disk occupation and faster loading speed.We will provide an option to choose between the
.npz
and.pt
formats.