Closed daniiki closed 3 years ago
Hi @daniiki,
Have you been training deepclusterv2 or swav ?
For swav, if the batch size is smaller than the number of prototypes it is then possible that some clusters remain unused.
For deepclusterv2, we do not enforce equipartition so it is posible that some clusters are empty. If this is a problem for you, you can add constraints like reassigning the empty clusters during training for example.
Thanks for the qick response! I'm using swav. I understand that empty clusters exist if the number of prototypes is larger than the batch size, but I thought to solve this problem you introduced the queue? I use the queue like mentioned in your paper after epoch 15.
When using the queue, both the queue features and the batch features are assigned together. I am not sure to understand your question.
Closing for no activity. Feel free to reopen if needed.
For swav, if the batch size is smaller than the number of prototypes it is then possible that some clusters remain unused.
For deepclusterv2, we do not enforce equipartition so it is posible that some clusters are empty. If this is a problem for you, you can add constraints like reassigning the empty clusters during training for example.
I am using SwAV to divide a dataset into two clusters (with 256 batchsize) and I have obtained a very small loss for the clustering, which I would assume it has been properly trained. However, when I tried to perform some cluster analysis using the output
tensor, I found they are pretty similar across the whole dataset that all fall into the same cluster (code is as same as above). Am I doing something wrong?
Hi @mathildecaron31
I trained a network from scratch with my own dataset and wrote some code that sorts images in different folders regarding their cluster assignments. I did this with the following lines of code:
The problem is that when I save the images in different folders regarding their cluster assignment, some folders remain empty. The number of folders is the same as the number of prototypes. I always thought that the images are equally distributed between the different prototypes. What is the problem? Can you help me?