Quesetion about principle of deepcluster

YoungJaeChoung commented 5 years ago

I tried to use deepcluster on non-image and non-linear dataset, i.e. spiral data etc.

When I do pseudo class methods, it`s not obvious to me that how pseudo-class (which represents clustering results of k-means whose objective is a convex form) could be used to separate nonlinear data.

Can I receive some hint for this question...?

mathildecaron31 commented 5 years ago

Hi, It is not clear to me what you mean by 'non-linear dataset'; I guess you mean non linearly separable. Well in this case, we are in an unsupervised setting, so we don't really have access to labels. Thus, the notion of linearly separable is blurry in this setting. You'll always find pseudo-labels that make the problem linearly separable (each data represents its own class for example).

Pseudo-classes assignment is performed in the feature space (using the activations of the penultimate layer of the network) and not in the raw input space. In this space, the data are more likely to be separable. As you say, k-means clustering produces an assignment of the data into clusters that are linearly separable (Voronoi diagram). The linear classifier (ie last layer of the network) then has to discriminate between different linearly separable pseudo-classes.

Hope it helps

YoungJaeChoung commented 5 years ago

@mathildecaron31

Thank you for your answer. I mean non linearly separable data as you guessed.

My curiosity has been resolved :-)

facebookresearch / deepcluster

Quesetion about principle of deepcluster #21