How did you train the k-means clustering model on the HuBERT model?

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MIT License

30.61k stars 6.42k forks source link

❓ Questions and Help

My question

Hello, due to my downstream task requirements, I need to perform k-means clustering on the output of Contentvec model, that has the same structure as the HuBERT model but with a different training idea. I have performed feature extraction on my dataset on Contentvec and learnt a clustering model using the code you provided. However I found the clustering to be far less effective than the clustering model you provided for HuBERT.

Do you do any special treatment of the features (such as dimensionality reduction) before training the clustering model? Or maybe my dataset is small in size (7430431* 768)? Or if you can make valuable suggestions for my clustering, I would appreciate it!

facebookresearch / fairseq

How did you train the k-means clustering model on the HuBERT model? #5460

❓ Questions and Help

My question

The code I have tried for clustering：