how to format h5 files for input?

jefflai108 / Contrastive-Predictive-Coding-PyTorch

Contrastive Predictive Coding for Automatic Speaker Verification

MIT License

472 stars 96 forks source link

how to format h5 files for input? #8

Closed kachiem closed 4 years ago

kachiem commented 4 years ago

Hi, thanks for sharing your implementation of CPC. I've been trying to run it out of the box but am having issues shaping the input data correctly. Is there another script that encodes the wav file directories into .h5?

jefflai108 commented 4 years ago

What's the error? I think h5py is pretty standard

kachiem commented 4 years ago

I'm wondering if I'm missing some structure for either the .h5 files or for .txt files. Currently my training lists are txt files containing a list of .wav file ids. My h5's are also a single group of wav files from train, test, and dev sets. I tried to preserve the hierarchical structure of the LibriSpeech corpus.

jefflai108 commented 4 years ago

your key name is incorrect. You may need to modify the original script.