Open TATEXH opened 5 months ago
Hi, for music we used This audio is a <genre> song
. I think the task you are dealing with is also a bit of out of distribution of the training data. I don't think we included a lot of music with mood labels in music version of the CLAP.
Best,
Thanks for the reply. I will try it with your text.
Hi I am using music_audioset_epoch_15_esc_90.14.pt as a music classifier. I would like to classify the mood and genre of our music files. I am trying to find the cosine similarity using the text "The mood of this song is (romantic, energetic, etc)" but I only get about 0.4. I think that if I use a text similar to the one you used in your training, the value will be better, so could you please tell me what type of text you used?