vimalmanohar / pychain-old

PyTorch implementation of LF-MMI for End-to-end ASR
7 stars 1 forks source link

Implement DataLoader that can read MFCCs and FSTs #2

Open vimalmanohar opened 5 years ago

vimalmanohar commented 5 years ago

DataLoader class to load MFCC features and e2e-format FSTs from disk into a pair of features Tensor and FST and create minibatches.

We can follow the way Deepspeech does this i.e. to use an SCP-like file that has the filepaths so that the features and FSTs can be read at the time of creating the minibatch. This should be implemented such that it can return a single pair (features, fst) read from disk when used with python iterator.

vimalmanohar commented 5 years ago

This is only for testing purposes. This can be moved outside to keep loss function as a separate module.