Open Vichoko opened 4 years ago
Hi, if you check paper from Leglaive et al. which is referenced in readme.md, you will see that they mention that singing voice is considered to be in between percussive and harmonic components. I think it makes sense since pronouncing consonants, for instance, are more percussive than harmonic. Also, singing style varies so it is hard to assume that double harmonic components will capture all the singing styles. But if you are not reproducing the paper, you are always free to experiment with different methods. :)
I've been thinking a lot about this code fragment in https://github.com/kyungyunlee/ismir2018-revisiting-svd/blob/master/leglaive_lstm/audio_processor.py in function process_single_audio (Compute double stage HPSS for the given audio file) in lines 24-33:
The
D2_harmonic
andD2_percussive
are calculated from theD_percussive
component.Is this right? I'm currently checking the original paper and i will keep you updated if i discover something.
This seems kinda odd, since my intuition says that the harmonic component has more importance to voice activity detection.