helmholtz-analytics / heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
https://heat.readthedocs.io/
MIT License
212 stars 53 forks source link

Implement Nystrom method for better scaling behaviour #640

Open Cdebus opened 4 years ago

Cdebus commented 4 years ago

Current Spectral Clsutering algorihm is based on full distance matrix calculation and Lanczos algorithm for Eigenvalue decomposition. Whilst this provides accurate result at acceptable speed, the Nystrom method via approximating the distance matrix might yield better scaling behavious

ClaudiaComito commented 2 years ago

@Cdebus @Markus-Goetz is this what you're referring to?

https://scikit-learn.org/stable/modules/generated/sklearn.kernel_approximation.Nystroem.html

github-actions[bot] commented 1 year ago

Branch 640-Implement_Nystrom_method_for_better_scaling_behaviour created!

mrfh92 commented 1 year ago

Implementing this idea and trying out how much it improves performance would be an interesting idea for a small/mid-size student work; although not urgent because a routine for spectral clustering already exists.

Reviewed within #1109