YuanGongND / gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
BSD 3-Clause "New" or "Revised" License
143 stars 25 forks source link

How to improve speed for GOPT #36

Open vinh22032000 opened 5 months ago

vinh22032000 commented 5 months ago

Hello everyone, I am working on a project where I need to use GOPT to evaluate a voice record in real-time. I am using Yuan Gong tutorial to process each speak sentence but it is quite slow. I guess it is slow due to the overhead of loading the kadi model to extract the Gop feats. Any solution to this? I can only think of using PyKaldi to cache the model in the memory. Thanks

gsabarinath02 commented 5 months ago

Hey,

Are you able to achieve this? I'm also working on a similar project to evaluate voice recordings in real-time. Could you tell me how you achieved this? I've encountered some errors, like "utt2spk" and "spk2utt" not matching. If you're willing, we can collaborate and develop a robust system where people can assist each other in recording their voices in real-time and evaluating them.

If anyone is interested in joining this effort, we can work together and accomplish it as soon as possible. I don't need any credits since I'm pursuing this for study purposes and out of passion. Anyone can take credit if desired.

Let me know if you're interested in collaborating!