Alexander-H-Liu / End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
MIT License
1.18k stars 317 forks source link

Is there anybody uses custom data using librosa for traning LAS #55

Open qute012 opened 4 years ago

qute012 commented 4 years ago

Hi. I want to change dataset and recognize in korean. I use librosa mel-spectro as extraction features. It shows below. image I didn't modify model parameters and train parameters. But model predicts same class almost one class when i gave some samples.

Below, When i gave sample by using torch.randn, results image It was almost just 2 classes. Is there problems at preprocessing audio file(e.g. mfcc, mel spectro) or models?

anybody knows?