Closed chadlagore closed 6 years ago
What code is generating the two spectrograms that are in your PR description? I'm curious why so much of the second one is purple (what information is it actually encoding, compared to the first?) 😮
https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.spectrogram.html mode="phase"
produces the first spectrogram. I agree that the fact that the purple spectrogram learns is rather surprising. #114 will allow users to configure this a bit more.
Handles #103 #108 #111
:construction_worker: Changes
matplotlib
spectrograms withscipy.signal.spectrogram
which produces a flat (2d spectrogram). This will likely reduce training time at (hopefully) no accuracy cost.pipenv
!!! :tada:Typical Usage
Rebuilding the Base Model With New Speakers
:flashlight: Testing Instructions
For now,
Lets keep the coverage up!
Repo Layout
Base Model(s)
minutes/models/cnn.5
on flat spectrograms (no channels).BaseModel.fit
method.Dropout
, it achieved > 95% validation accuracy, but performed far worse in a transfer situation. This makes sense, considering dropout reduces overfitting to the current speakers!Flac-Wav Conversion
Rather crudely, I did something like:
This is obviously not an option for the library. We should find a way to read in
.flac
files properly (#110).Other Notes