Open lkraav opened 7 years ago
Yes, this needs to be clarified in the README.
Just to let you know, it only changes the names of the speakers in the output trs files, and the recognized speakers is a closed set of Estonian public figures who occur often enough in Estonian broadcast news (see for example the names in http://bark.phon.ioc.ee/tsab/p/play?trans=8840). It's not possible to change it or retrain it any way (currently). So it probably only interest you if you process Estonian broadcast news.
Hehe, yeah this was useful information. Well, I'm running it on my custom audio, let's see what it comes up with. A retraining process would definitely be useful.
Perhaps the README could clarify what the expected process output is when speaker ID feature is enabled? What is supposed to look different in the text output compared to disabling speaker ID. Is it possible to give speakers names via some transcription configuration file, or is that post-text-editing work?