-
Hi,
When I tried to extract fbank features I found missing frequencies. So, I checked the filter bank in your code and found this:
![image](https://user-images.githubusercontent.com/24974144/62096…
-
Variable-Q transform is very similar to constant-Q transform except the Q value is lower as the frequency decreases, which is useful if you want to have better time resolution on lower frequencies at …
-
What's the best way to calculate loudness per frame? And can we make it a function?
## TLDR;
Is it this?
``` python
S = abs(FFT(y))**2 # power spectrogram
weighting = A_weighting # weighti…
-
Hi, I study auditory processing and it would be very useful to be able to use the linearRampToValue method for oscillator frequency on a "mel scale".
https://en.wikipedia.org/wiki/Mel_scale
This…
-
### Describe the bug
The following auxiliary function is used in [Filterbank](https://speechbrain.readthedocs.io/en/latest/_modules/speechbrain/processing/features.html#Filterbank) for calculation …
-
I've theorized that images of size 128 in height is not enough to accurately represent all frequencies, leading to a dissonant sound. If we tuned the mel spectrogram processing to create images of siz…
-
Dear author,
I was trying to understand how the sinc-layer in your code works. Could you, please, explain two lines in this part:
```
# initialize filterbanks using Mel scale
NFFT = 51…
-
As of now the mel fb is triangular, chroma fb is Gaussian. But, sometimes people use triangular fbs for computing chroma, and so on. I propose that instead of melfb and chromafb we have triangularfb…
-
Hi, I've been using the noise reduction algorithm to standardize the noise before input the signal into a deep learning model. The issue I found while doing the noise reduction is a slightly frequency…
-
### First Steps Update
- [x] **Project Initialization**: Set up the repository and created the initial README file.
- [ ] **Data Organization**: Collected and organized Telugu music into four catego…