Closed arnavmehta7 closed 1 year ago
What does "lexicon.txt" look like? Can you share any example textgrids of the misalignments, or files that I can try to replicate and see what's going on?
Is there any way to improve accuracy? I found that the data is not accurate if the quantity is only a few. I have tried installing different versions, such as 1.0, 2.2 and 3.0, but in the 10 tests, the accuracy is not as good as the results I got on the win system before, and some of them are very different.
@fangg2000 can you create a new issue? I'm going to close this one as stale. If possible, please include the specifics of the runs you've done, there are multiple reasons why you're not seeing the same accuracy.
Debugging checklist
[ x] Have you updated to latest MFA version? [ x] Have you tried rerunning the command with the
--clean
flag?Describe the issue The accuracy for the timestamps is too low it is mostly off by 1 seconds on short audios and 6 seconds on long audios
For Reproducing your issue Download the original librispeech dataset and run the
mfa align inputs lexicon.txt english_mfa outputs
Corpus structure
Dictionary
Acoustic model
Log file Please attach the log file for the run that encountered an error (by default these will be stored in
~/Documents/MFA
). No errorDesktop (please complete the following information):