Closed SarthakYadav closed 4 years ago
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Hello everyone, This is not an issue but rather a question. As seen in the implementation, the feature extraction is done as follows:
n_fft = int(self.sample_rate * self.window_size)
win_length = n_fft
hop_length = int(self.sample_rate * self.window_stride)
D = librosa.stft(audio, n_fft=n_fft, hop_length=hop_length, win_length=win_length, window=self.window)
spect, phase = librosa.magphase(D)
I have two questions:
Any resource to read on these feature extraction methods would be appreciated as well. Thanks!!!