Closed GeoffNN closed 3 years ago
I'm not sure how to do this without making the code much slower for sparse matrices. Currently the SAGA/SVRG code is very efficient for sparse matrices using the fact that the partial gradients are sparse. Using minimatches would destroy the sparsity in the gradients, making the code much slower in this regime
Both methods use
batch size=1
.