Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
3.66k
stars
425
forks
source link
Improve speaker recognition accuracy #1526
Open
thewh1teagle opened 1 week ago
The current speaker recognition is not accurate comparing to pyannote-rs See https://github.com/thewh1teagle/loud.cpp/issues/12