Closed pechersky closed 7 years ago
@maxhodak One thing I forgot to mention, this change also makes the preprocessed h5 file ~2X smaller, since this stores the tensor as a "float4" type instead of an "integer8" type, at least on my machine. Don't know if this is an undesired side effect.
The preprocessed file is a huge sparse binary matrix, so it seems like this shouldn't make a difference. There are definitely optimizations to do here but it hasn't been broken enough for me to bother with yet.
I don't think the float vs int encoding should make a difference here but I should probably investigate a little bit.
Mentioned in #18