ParlaSpeech data preparation procedure

This repository demonstrates the procedure and utilities used to automatically process large amounts of speech data in order to create a corpus which can be used to train models for speech processing, for example in automatic speech recognition.

The examples used here are based on the corpus of croatian parliamentary speech distributed using this link: http://hdl.handle.net/11356/1494