Closed Jiawei-Yang closed 3 years ago
Hi @Jiawei-Yang, Thanks for your kind words and for sharing this finding ! I have not noticed that, I think it might explained why using more clusters have not a big impact on the performance. Feel free to post more if you have other findings/analysis :).
Hi, thanks for your brilliant work!
I have found an interesting fact that many of the learned prototypes are duplicated.
I began with the question "how well do all the learned prototypes scatter?". So I downloaded the best pre-trained model from this link.
Interestingly, when I computed the pairwise cosine similarity among all prototypes, multiple pairs of prototypes have similarity scores of 1's and turn out to be the same.
Here is the code.
So, prototypes #0, #937, #1819, #2363 are in fact identical.
I wonder if you had noticed this fact, and do you have any idea about why this is the case?
Best, Jiawei