This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
187
stars
87
forks
source link
how does performance change with the length of the audio? #1
Is this method more suited to shorter bursts of audio or can it be employed for transcription on audio of longer duration say 30-40 minutes?
i.e is there a considerable decay in performance when applied to longer audio files?
Is this method more suited to shorter bursts of audio or can it be employed for transcription on audio of longer duration say 30-40 minutes? i.e is there a considerable decay in performance when applied to longer audio files?