YuanGongND / gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
BSD 3-Clause "New" or "Revised" License
152 stars 28 forks source link

Hi, I want to add some words to text-phone from CMU dict and librispeech-lexicon, but they are different with speechocen762 #28

Closed YangangCao closed 1 year ago

YangangCao commented 1 year ago

Hi, dear author, I want to infer some my own data, but some words don't exist in speechocean762/resource/lexicon.txt or speechocean762/resource/text-phone, so I want to add them. But I found a problem.

For example:

In speechocean762/recource/text-phone: 000010011.0 W_B IY0_E

In speechocean762/recource/lexicon.txt: WE W IY0

However, in every version CMU dict: WE W IY1

In librispeech-lexicon.txt WE W IY1

CMU website said 0 and 1 represent different lexical stress: 0 — No stress 1 — Primary stress 2 — Secondary stress

lots of difference stresses appear between speechocean762 and other dict, which one is true?

YangangCao commented 1 year ago

@LyWangPX

YuanGongND commented 1 year ago

hi there,

I think this should be a question to https://github.com/kaldi-asr/kaldi/issues?q=is%3Aissue+gop.

I closely follow the original GOP recipe and get the GOP features, which are also released. We do not propose anything new in GOP feature generation and you should be able to reproduce that by using a late 2021 commit of Kaldi GOP recipe and SO762 dataset.

-Yuan