magic-research / Dataset_Quantization

[ICCV2023] Dataset Quantization
252 stars 19 forks source link

visual method #11

Open maomaocun opened 2 months ago

maomaocun commented 2 months ago

What is the method that makes the visual of the feature of bins?T-Sne?

vimar-gu commented 2 months ago

Yes, we used t-SNE for the bin visualization. The sample indices of each bin are recorded and emphasized in different figures.

maomaocun commented 2 months ago

Can I get the specific source code? I tried both the dataset images and the features generated by the pre-trained model, but I found that neither was very obvious.

image image
vimar-gu commented 2 months ago

Please try visualizing samples from one single class, which is also the case presented in the paper. Running t-SNE on a large number of samples will decrease the quality of dimension reduction.

maomaocun commented 2 months ago
image image

It looks a bit strange, but it seems to reflect the characteristics of the data set distribution. Do you have any guidance? I performed t-SNE operation on the gradient generated by the pre-trained model of the data set, and the final image was normalized. The t-SNE parameters are as follows

image
vimar-gu commented 2 months ago

You should simply use the extracted features to perform t-SNE. The figures you presented seem kind of strange. There seem to be two groups of features. But for bin selection, there only exist real images, so there should not be such distribution gap.

I'm sorry I don't have the original script now. But the general procedure should be:

Do make sure that the sample order is consistent when performing bin selection and figure drawing.

maomaocun commented 2 months ago
image image

bin The first picture is bin1, and the second picture is bin10. It seems to be a little closer to the meaning of your paper.

vimar-gu commented 2 months ago

Thanks for the feedback. But the figures are still not correct.

At the bin selection stage, onyl original images are selected. It means that the red dots (selected samples) should exactly cover the corresponding blue dots (original samples). And the red dots in the first figure should be of blue color in the other figures. But the presented two figures don't meet the principle. The features of a class should be processed through t-SNE all together. Then only the selected indices are assigned with a different color. Please check the whole process again and see if the red dots are based on different features.