k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
https://k2-fsa.github.io/sherpa/onnx/index.html
Apache License 2.0
3.66k stars 425 forks source link

Improve speaker recognition accuracy #1526

Open thewh1teagle opened 1 week ago

thewh1teagle commented 1 week ago

The current speaker recognition is not accurate comparing to pyannote-rs See https://github.com/thewh1teagle/loud.cpp/issues/12