adding word boundaries to the acoustic model for Na

persephone-tools / persephone

A tool for automatic phoneme transcription

Apache License 2.0

156 stars 26 forks source link

Since 2018, the model for Na includes tone-group boundaries. But up till now (Oct. 2018), the model for Na still disregards word boundaries. A look at story-fold cross-validation materials suggests that longer words have somewhat different acoustic properties. So there could be value for phoneme & tone recognition in adding word boundaries to the training.

A first step (suggested by @oadams ) could be to produce separate error rates for short words versus longer words by using the word segmentation in the reference transcription as a guide.

(Suggested label for this Issue: Yongning Na)

persephone-tools / persephone

adding word boundaries to the acoustic model for Na #210