-
For app users an easy to use (and GDPR compliant) speech to text function would be helpful.
-
Some times there are multiple entries for the same phrase. There should be an interface to merge them, and a script to automatically merge entries whose transcription and translation are the same.
(…
-
Does the speech pre-training considers speech-to-text task? Or is the model being trained for speaker verification?
-
This new model seems suitable for integration: https://github.com/edwko/OuteTTS
We should add a very minimalistic example for generating audio with it. Ideally, we will implement the (audio tokens)…
-
Add full party name to anywhere speech data is displayed. To be used when the user clicks on a speech, to display the full party name instead of just the abbreviation.
NO HURRY.
-
Could you add a native speech to speech / audio-to-audio support with encoder (tokenizer) and decoder (back to audio waves)
I was able to implement a decoder only model, I first used audio codec to…
-
bash ./run_server_2pass_ssl.sh
I20241122 11:17:54.294150 43319 funasr-wss-server-2pass.cpp:21] model-dir : damo/speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404-onnx
I20241122 1…
-
### Before reporting, I have confirmed that
- [X] This bug does not appear to be reported on [GitHub Issues](https://github.com/Minecraft-Transit-Railway/Minecraft-Transit-Railway/issues?q=is%3Aiss…
-
报错如下,请教各位大佬
2024-11-04 16:40:25.285 | INFO | app.services.voice:azure_tts_v1:1057 - start, voice name: zh-CN-XiaoyiNeural, try: 1
2024-11-04 16:40:35.693 | ERROR | app.services.voice:azure_…
-
I'm trying to use vosk to recognition korean speech.
But I don't have still good way to do it. is there any good way or website?