koheiw / proxyC

R package for large-scale similarity/distance computation
GNU General Public License v3.0
29 stars 6 forks source link

Dev zapsmall #52

Closed koheiw closed 4 months ago

koheiw commented 4 months ago

I notice that zapsmall() takes very long time when the similarity matrix is large and dense, so moved this to C++. This and other changes in this branch makes handling of denser similarity matrices a lot more efficient.

options("proxyC.threads" = 12)
mt <- Matrix::rsparsematrix(50000, 10000, 0.001)
profvis::profvis(
    out <- proxyC::simil(mt, use_nan = FALSE)
)

Master

image

This branch

image

koheiw commented 4 months ago

@rcannood can you review?