jimbozhang / speechocean762

A non-native English corpus for pronunciation scoring task
105 stars 21 forks source link

Hi, I want to add some words to text-phone from CMU dict and librispeech-lexicon, but they are different with speechocen762 #7

Open YangangCao opened 1 year ago

YangangCao commented 1 year ago

Hi, dear author, I found a problem about stress pronunciation.

For example:

In speechocean762/recource/text-phone: 000010011.0 W_B IY0_E

In speechocean762/recource/lexicon.txt: WE W IY0

However, in every version CMU dict: WE W IY1

In librispeech-lexicon.txt WE W IY1

CMU website said 0 and 1 represent different lexical stress: 0 — No stress 1 — Primary stress 2 — Secondary stress

lots of difference stresses appear between speechocean762 and other dict, which one is true? will the different stress pronunciations cause the different GOP?

ElsebaiyMohamed commented 8 months ago

@YangangCao

I think the annotation here about what speaker say not what he should say.