YuanGongND / gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
BSD 3-Clause "New" or "Revised" License
148 stars 25 forks source link

About infer the data #26

Open dyustc opened 1 year ago

dyustc commented 1 year ago

Hi, I am doing the inference as the inference tutorial told. This perhaps is a stupid question, but it said it only has a wav file to infer, so I did , and got the error from kaldi/egs/gop_speechocean762/s5/run.sh, and I did manage to train locally with speachocean762 and generate GOP features with the original dataset. So how should my own dataset look like? ./run.sh utils/validate_data_dir.sh: WARNING: you have only one speaker. This probably a bad idea. Search for the word 'bold' in http://kaldi-asr.org/doc/data_prep.html for more information. utils/validate_data_dir.sh: Error: in data/train, utterance-ids extracted from utt2spk and features utils/validate_data_dir.sh: differ, partial diff is: --- /tmp/kaldi.nkqk/utts 2023-05-16 16:34:52.724637343 +0800 +++ /tmp/kaldi.nkqk/utts.feats 2023-05-16 16:34:52.800640123 +0800

Rtut654 commented 1 year ago

Hey! Have you succeed with running local inference without weird output of too high results (higher than 1.5)?

dyustc commented 1 year ago

@Rtut654 sorry, I kinda of forgot, not too much success here, but I remember running GOP in kaldi would bring a GOP score already.