cageyoko / CTC-Attention-Mispronunciation

A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
56 stars 21 forks source link

A question about the data processing #7

Closed teinhonglo closed 2 years ago

teinhonglo commented 2 years ago

Hi,

There are some mismatch between your phoneme map and phoneme map in the kaldi.

The different as follow:

https://github.com/kaldi-asr/kaldi/blob/master/egs/timit/s5/conf/phones.60-48-39.map#L4 https://github.com/cageyoko/CTC-Attention-Mispronunciation/blob/master/egs/attention_aug/conf/phones.60-48-39.map#L4

https://github.com/kaldi-asr/kaldi/blob/master/egs/timit/s5/conf/phones.60-48-39.map#L16 https://github.com/cageyoko/CTC-Attention-Mispronunciation/blob/master/egs/attention_aug/conf/phones.60-48-39.map#L16

https://github.com/kaldi-asr/kaldi/blob/master/egs/timit/s5/conf/phones.60-48-39.map#L61 https://github.com/cageyoko/CTC-Attention-Mispronunciation/blob/master/egs/attention_aug/conf/phones.60-48-39.map#L61

Any suggestion?

--2022/5/22-- Oh, I got it. The reason for this is the phoneme table that is used in l2-arctic. Is that correct?

cageyoko commented 2 years ago

I‘m so sorry for my delays in replying to your question. Yes, some phonemes do not appear in L2-arctic in the original version. We replace them.