-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
from RealtimeSTT import AudioToTextRecorder
import pyperclip
def process_text(text):
pyperclip.copy(text)
if __name__ == '__main__':
print("Wait until it says 'speak now'")
r…
-
## **Summary**
First of all, thank you for the outstanding work on this mod. It's impressive, and I appreciate the effort and dedication that has gone into its development.
Let's jump straight t…
-
# Code
**model_config = outetts.HFModelConfig_v1(model_path=r"D:\model\tts\OuteAI\OuteTTS-0.1-350M", language="en", wavtokenizer_model_path=r"D:\model\tts\\OuteAI\wavtokenizer_large_speech_320_24k.c…
-
Hello! Could you please add SALMONN series models?
Title | Venue | Date | Code | Demo
-- | -- | -- | -- | --
[SALMONN: Towards Generic Hearing Abilities for Large Language Models](https://arxiv.o…
-
Does this issue occur when all extensions are disabled?: Yes/No
- VS Code Version:
- OS Version:
## Bugreport: "VS Code Speech" Extension Crashes on Activation
### Desc…
-
4080显卡,速度可能不到原来的1%,堪比用CPU跑。但看显卡占用又跑满了,找不到原因。是否没有正确调用到打包里的PyTorch和TensorFlow所致?
fasterwhispergui.log如下:
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and …
-
Hi, I am running faster-whisper on an audio file like follows
```
segments, info = model.transcribe(wav, task="transcribe", language="hi",beam_size=1, word_timestamps=True,max_new_tokens=50 )
```…
-
I have combined the phoneme sets for all three langauges,
English, Chinese, Japanese and started fine tuning using a datset comprised of all three speech languages
The base model I use is the chine…
-
I'm using Azure Speech to Text SDK version 1.21 in Golang. Here I'm adding a feature of Auto-detect language for the audio file using
```Go
engLangConfing, err := speech.NewSourceLanguageConf…