flashlight / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit
https://github.com/facebookresearch/wav2letter/wiki
Other
6.39k stars 1.01k forks source link

Flac Issue in create.lua #10

Closed DenisPeskoff closed 6 years ago

DenisPeskoff commented 6 years ago

Background/Use Case At the moment, I'm simply interested in using it for pre-trained decoding. (I'd ideally extract the lattice/confidences as well if possible with your flags).

System: Mac El Capitan, CPU

Assumptions Made in Installation: Skipped MKL install since not training Boost installed through brew install Skipped MPI installs since not planning on training again Skipped CUDNN and CUNN installs since CPU (test.lau currently requires this, but I commented those relevant parts out).

I'm currently stuck at Training wav2letter models

luajit issue with flac data I throughly followed the installation instructions per the README. To ultimately run the decoder, I need a librispeech-proc/letters.lst file, which I assume is generated by:

luajit ~/wav2letter/data/librispeech/create.lua...

However, downloading and unpacking everything as required, this command claims that my .flac files are in an "unimplemented format".

As far as I understood, downloading all of the LibriSpeech data is needed to decode, since you need to create letters.lst. Perhaps you could upload a letters.lst file for LibriSpeech, and then none of this will be necessary?

Thank you!

saisrinivas047 commented 6 years ago

@DenisPeskov

I have the same problem. Can you tell me how did you get the librispeech-proc folder ?? PS-I am running pretrained model on single CPU

VitaliyLi commented 6 years ago
I've seen "unimplemented format" error due to libsndfile missing libraries for processing flac files. letters.lst is simply a list of all letters from English + apostrophe and silence ( ) (I don't want to upload letters.lst as it is dataset specific):

' a b c d e f g h i j k l m n o p q r s t u v w x y z

DenisPeskoff commented 6 years ago

Thanks for that list. Numeric values have to be added as well I believe (0-9).

I have the proper library and the flac files are not seen (went through and relinked just to be sure). test.lau can see the flac file if the file name is changed to match it.