Open seyong92 opened 1 year ago
Hi, Thanks for your interest.
Thank you for the fast comment! I will change the learning rate and share the results after training.
Also, I found that some of the files are not correctly resampled to 44100, as you said. Thanks!
Hello, thank you for the valuable code sharing!
I have several questions about the code.
The default parameter for training is different from the pre-trained model in the repo. For the default setting, it has 229 mel bins (as same as the paper), but the pre-trained model has 300 mel bins. Also, f_min and f_max value are different. Also I found that the pre-trained model has one more conv layer in the PreConvSpec. Does this change have a meaningful change on the performance?
Also, when I tried the training (once with the default parameter, and the other with the pre-trained model parameter), both cases shows much lower performance than the pre-trained model (0.7403 for valid F1) and the score reported in the paper. I think the only difference is the batch size, which is 12 in the paper and 2 in the default parameter. Have you ever trained the model with batch size 2 or trained the model with the default parameter in this repo?
Again, thank you very much for sharing your code! 😁