Creating chunks of Buckeye Corpus

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

MIT License

1.26k stars 242 forks source link

In the 2017 InterSpeech paper, section 3.1: Datasets includes the following sentence:

We thus broke up Buckeye into chunks bounded by non-speech (pauses, noise, interviewer speech) of >150 msec...

I am confused here. What is the >150 ms filtering applied to?

1] the length of the chunks (end _time of last token - start_time of first token in the chunk) i.e., chunks with duration < 150 msec are discarded

2] the non-speech tokens that are used to separate the chunks i.e, non-speech tokens with duration < 150 msec are not used to split the ongoing chunk, instead it is included and we continue.

MontrealCorpusTools / Montreal-Forced-Aligner

Creating chunks of Buckeye Corpus #775