Open lyapple2008 opened 5 years ago
Hi, is there any silence in front of your sample, if not, the result may be not good. Because ACAM is context based model, there should be some samples to capture the speech context. Please send me your sample to jtkim@kaist.ac.kr I'll debug it for you.
Thank you for your reply. And I had sent the test audio to your email.
As the title said, I found the corpus at the beginning always be detected as non-speech. Can you explain it?