YuanGongND / gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
BSD 3-Clause "New" or "Revised" License
148 stars 27 forks source link

Can't not infer my own data, the result belongs to speechocean762 dataset #24

Closed JustVinh closed 1 year ago

JustVinh commented 1 year ago

How did you infer your own wav file? I followed the inference instruction, but the result I got is [u1, u2, u3, u4, u5, p, w1, w2, w3] of the speechocean762 dataset (i realized cause its tensor shape of u1 is (2500,1) while I only have 1 wav file), but not my wav file :((((

And in inference instruction, instructor didn't mention where to put your own dataset folder, nor when to specify the path to your own dataset path in code.