ubclaunchpad / minutes

:telescope: Speaker diarization via transfer learning
https://medium.com/ubc-launch-pad-software-engineering-blog/speaker-diarisation-using-transfer-learning-47ca1a1226f4
27 stars 5 forks source link

Generating spectrograms from given base audiofiles #106

Closed grig-guz closed 6 years ago

grig-guz commented 6 years ago

:construction_worker: Changes

Given a dictionary with speaker ids as keys and lists of file paths as values, generate a spectrogram dataset.

:flashlight: Testing Instructions

Get data, build the dict, check resulting shapes! Also, try

plt.imshow(X_train[index], aspect = "auto")

to see if spectrograms look ok.

iKevinY commented 6 years ago

@chadlagore implemented a variant of this in 75243456c232e0e5295129b52cf9181f7ec23cd6 as part of the library overhaul in #107.