Closed kkm000 closed 5 years ago
@danpovey, quick questions if/when you have time.
make_utt2dur.sh
converts utt2num_frames
only as the last resort, i. e. only in case wav.scp
is not present; otherwise it may run the whole pipeline. Is it sensible to prefer creating utt2dur
from utt2num_frames
, if it is present? The only downside I see is the lower precision, down to frame shift, i. e. 10ms typically. Should be acceptable, I think?
Related to feature extraction in general. When I create data directories, I always place a copy the feature config file that was used to extract features along feats.scp, under the name matching feature type (e. g. mfcc.conf
, even if conf/mfcc_hires.conf was used). Helps me a ton to avoid a mess-up of variously extracted features. Do you think we should just always to that?
utils/data/get_frame_shift.sh
first blindly creates utt2dur
(expensive!), then checks if the file frame_shift
exists. I'll rearrange the order, indeed, but a better solution seems to always create the file frame_shift
along with features?
From kaldi-help, "Purpose and use of utt2dur":
@kkm000:
@danpovey:
@kkm000: