k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
https://k2-fsa.github.io/sherpa/onnx/index.html
Apache License 2.0
3.61k stars 422 forks source link

Speak Diarization with ASR #1487

Closed CSLukkun closed 2 weeks ago

CSLukkun commented 2 weeks ago

你好,想请教一个问题。

我希望将一个音频中的说话人进行分离。每一段切分的子音频 有开始时间,持续时间,说话的内容,以及说话人的id。 我不清楚在这个仓库中是否提供这样的功能。

感谢

csukuangfj commented 2 weeks ago

支持的

CSLukkun commented 2 weeks ago

您能提供一些实例 或者 指导给我吗?非常感谢

On Oct 28, 2024, at 18:52, Fangjun Kuang @.***> wrote:

支持的

— Reply to this email directly, view it on GitHubhttps://github.com/k2-fsa/sherpa-onnx/issues/1487#issuecomment-2441245279, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ARB63VNIKF2W4VQ44GKXOV3Z5YJO7AVCNFSM6AAAAABQXAXW32VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINBRGI2DKMRXHE. You are receiving this because you authored the thread.Message ID: @.***>

csukuangfj commented 2 weeks ago

请自己看我们的提供的文档。

我们有提供 speaker diarization的例子

有提供 asr 的例子

你自己组合起来,就是你想要的。