Hack to get 1 dimensional output

fschmid56 / EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

MIT License

218 stars 41 forks source link

I'm not sure why this was needed, but I had to add this hack to get num_classes=1 to work:

<             #num_classes = state_dict['classifier.1.bias'].size(0)
<             num_classes = state_dict['classifier.2.bias'].size(0)
---
>             num_classes = state_dict['classifier.1.bias'].size(0)
313,315d299
<             if "classifier.2.weight" in state_dict:
<                 del state_dict['classifier.2.weight']
<                 del state_dict['classifier.2.bias']

I won't push a fix because I don't understand the impact of this on other users. Perhaps it should only be used when num_classes is 1?

fschmid56 / EfficientAT

Hack to get 1 dimensional output #12