Supported functions
Speech recognition |
Speech synthesis |
Speaker verification |
Speaker identification |
✔️ |
✔️ |
✔️ |
✔️ |
Spoken Language identification |
Audio tagging |
Voice activity detection |
Keyword spotting |
✔️ |
✔️ |
✔️ |
✔️ |
Supported platforms
Architecture |
Android |
iOS |
Windows |
macOS |
linux |
x64 |
✔️ |
|
✔️ |
✔️ |
✔️ |
x86 |
✔️ |
|
✔️ |
|
|
arm64 |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
arm32 |
✔️ |
|
|
|
✔️ |
riscv64 |
|
|
|
|
✔️ |
Supported programming languages
C++ |
C |
Python |
C# |
Java |
JavaScript |
Kotlin |
Swift |
Go |
Dart |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
✔️ |
It also supports WebAssembly.
Introduction
This repository supports running the following functions locally
- Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
- Text-to-speech (i.e., TTS)
- Speaker identification
- Speaker verification
- Spoken language identification
- Audio tagging
- VAD (e.g., silero-vad)
- Keyword spotting
on the following platforms and operating systems:
with the following APIs
- C++, C, Python, Go,
C#
- Java, Kotlin, JavaScript
- Swift
- Dart
Links for pre-built Android APKs
Links for pre-built Flutter APPs
Description |
URL |
中国用户 |
Streaming speech recognition |
Address |
点此 |
Links for pre-trained models
Useful links
How to reach us
Please see
https://k2-fsa.github.io/sherpa/social-groups.html
for 新一代 Kaldi 微信交流群 and QQ 交流群.