kwikteam / klustakwik2

Fast software for high-dimensional cluster analysis using the masked EM algorithm for Gaussians mixtures
BSD 3-Clause "New" or "Revised" License
27 stars 13 forks source link

Try using a large number of mask starts #36

Open thesamovar opened 9 years ago

thesamovar commented 9 years ago

Once the code has been modified so that M and E steps are merged and covariance matrices don't have to be stored, we can use much larger numbers of clusters. In particular, we could consider one initial cluster for each unique binary mask (or maybe we should reduce them a little bit). This could potentially solve a lot of the problems with big cluster masks.