-
# [Paper Review] Speech Model Pre-training for End-to-End Spoken Language Understanding - Engineering Blog
Speech Model Pre-training for End-to-End Spoken Language Understanding
[https://hyungjung-l…
-
Hello, this is amazing.
I want to ask is it can be trained in other languages, or even if can be trained in multiple languages at the same time.
-
**Is your feature request related to a problem? Please describe.**
As I only speak English, I cannot really use the cosyvoice gradio interface.
**Describe the solution you'd like**
Please conside…
-
I have a NLP model that can predict if someone is in danger based on what they say(speech is converted to text then text is being analysed). However, it currently only works with English text. I'd lik…
-
Hello,
When installed on a local Ubuntu WSL under windows 11,
It currently works fine with --cpu_mode but throws "RuntimeError: CUDA failed with error out of memory" in standard mode. Unable to us…
-
### Steps to reproduce
1. Click on microphone symbol in keyboard to open STT
### Expected behaviour
Indicate that keyboard starts a STT (speech to text) service. Make input of returned string t…
-
### Describe the bug
Sometimes the speech pauses then the speaker continues but it's neither written nor is it any language, but it's clearly the same speaker. Unless you want to create a horror mo…
-
### Feature request
Maybe I'm just overlooking it, but it would rock if it were possible to do TTS for more languages. English is well catered for with `T5`, but for other languages I have to fall …
-
/kind feature
**Describe the solution you'd like**
Hope add [https://github.com/xorbitsai/inference](https://github.com/xorbitsai/inference) as the kserve huggingface LLMs serving runtime
Xor…
-
Hi,
I’m currently using RealtimeSTT with the following configuration:
```
recorder_config = {
'spinner': False,
'model': 'large-v2',
'language': 'en',
'silero_sensitivity': …