Closed RicherMans closed 1 year ago
Hey Heinrich,
thanks for pointing this out so quickly. Indeed there was a problem in the _fname_toidx.pkl file, which was caused by the concatenation of the balanced and unbalanced subsets of the datasets. I apologize for the inconvenience, the problem should be fixed now.
Best, Florian
Hey Florian, thanks for the quick answer, the labels all look good now and I can train my models.
Best, Heinrich
I would be interested in how well they work for you compared to what you were previously using for KD if you have some results.
Hey Florian, so it works I guess as advertised :).
I generally only use MobileNetV2 (with decision mean pooling) with 64 mels and a sampling rate of 16k, which is quite different from your teacher setting. Because of your paper, I also used some ViT models of my own, which were trained using the exact same features as above. I also ran experiments for global average pooling (GAP) for MBv2 just for comparison's sake. Results are:
Model | Teacher | mAP |
---|---|---|
MBv2-DM | None | 42.15 |
MBv2-DM | My-ViT | 43.51 |
MBv2-GAP | EfficientKD | 42.15 |
MBv2-DM | EfficientKD | 43.53 |
So results look alright, given the small size of the MobileNetV2 model, so thanks for that.
Hey Florian, I checked out your new fname_to_index file and trained some models, but performance is extremely bad.
Then just proceeded to print some scores from your provided
fname_to_idx.pkl
file and check out the ground truth.I used this simple code to map your indexes to the fnames:
Some of the outputs are:
The ground truth for that file however, is "Music", you can check out the source at:
https://youtu.be/09c885WMtMw?t=80
Another sample is:
Which can be viewed here: https://youtu.be/09bFB0X-8QY?t=16
I'm reasonably confident that your fname_to_index is somewhat wrong, could you maybe check it out if its the case?
EDIT:
In the above code snippet, the following will throw an error, which means there are some duplicate indexes:
Kind Regards, Heinrich