-
I can't even download TTS.
https://github.com/k2-fsa/sherpa-onnx/releases --Only ASR ! Where's TTS?
-
I'm looking for an alternative to verify users by their voice.
Do you provide sample code for that?
Thanks and congratulations on your project
-
All data and config: [test_project1-G3-lean.zip](https://github.com/ALIZE-Speaker-Recognition/LIA_RAL/files/4509265/test_project1-G3-lean.zip)
I have made a very simple test with **3 speakers** in …
-
Try improve:
* Word recognition accuracy
* Speaker diarisation/identification accuracy
Things to try:
1. Remove noise from audio clip
2. Research Google Speech to Text API - and how to improve …
-
I get a lot of "Speaker?" in the final file and i do not know how to improve this.
Maybe you can give a few tips how to work with the pipeline.
-
"Is there any way we could do Tech & Check Alerts on Pod Save America?"
To answer this we have to look into whether or not there are transcripts associated with the podcast; if not, we can explore …
-
#### Overview
We propose to implement audio-visual calls and screen sharing within our platform's channels using the WebRTC technology facilitated by the PeerJS client/server framework. This feature w…
-
Thanks for this repo.Excellent recognition results!
Looking forward to open source code for Android platform deployment.
-
![image](https://github.com/LIN-SHANG/InstructERC/assets/54592017/8f96b0d3-c687-4051-a0b3-73281d196139)
您好,从代码看好像不能同时设定emotion_prediction='True' 和speaker_task='True',实际使用时也会报错找不到data目录。
qftie updated
6 months ago
-
Hello.
First of all very big thank you for this project.
I am trying to create an example with a speaker model
to get the X-vector of the speaker (voice fingerprint).
I am using this example: …