hi @sanchit-gandhi
I think the concatenate strategy is excellent, but I have a question.
When concatenate_audio=True, I believe it is necessary to enable the wer_threshold filtering during the Pseudo-Labelling phase instead of waiting until the Training phase. This is because many short audio segments may have high levels of noise individually, but when concatenated, they might not be filtered out.
hi @sanchit-gandhi I think the concatenate strategy is excellent, but I have a question. When concatenate_audio=True, I believe it is necessary to enable the wer_threshold filtering during the Pseudo-Labelling phase instead of waiting until the Training phase. This is because many short audio segments may have high levels of noise individually, but when concatenated, they might not be filtered out.