Implementation Speed - Githubissues

@ayushgupt thanks for opening the issue. The approach is a nearest neighbor approach and thus inherently slow, especially with a large number of observations. With 37k observations, you can definitely expect it to take a while to return scores. While the implementation could probably be improved in regards to speed, the LoOP approach is computationally expensive with a large number of observations.

I'm not sure what you're data looks like, but one option is to use Hamlet et. al.'s modified implementation of LoOP that is included in this package. Their approach allows one to fit LoOP on "training" data, and then score incoming observations against the original fit. It's not as accurate as fitting LoOP outright to all data, but should help in regards to speed if that is what you're looking for.

Hope this helps. Feel free to comment further, but I'll be closing the issue as the issue you mentioned above is an inherent trait of the algorithm and approach and not of the implementation.

vc1492a / PyNomaly

Implementation Speed #11