How can we do this own custom dataset?

anjandeepsahni / automatic_speech_recognition

Speech to text transcription using RNN (Listen, Attend and Spell).

MIT License

11 stars 6 forks source link

Open razasaddiqi opened 4 years ago

razasaddiqi commented 4 years ago

can you please tell what should be the structure of dataset to feed into this model and what feature you extracted from audio from all out there such as mel spectrogram, filterbank, mfcc etc.
how will i map each recording to it's relevant trancript characters.

Ishu-dev commented 8 months ago

Please add some reply as to what should be the structure of data