-
I am getting the following error when using the `whisper` engine with `align`.
```console
2024-07-31 13:00:06.2345858 [E:onnxruntime:, sequential_executor.cc:516 onnxruntime::ExecuteKernel] Non-ze…
cvl01 updated
1 month ago
-
@Niketkumardheeryan I would like to add a lip read model(LipNet) using CV
This project aims to build a lip reading model that takes a video of speech without audio and produces the output as text t…
-
- Speech SDK log taken from a run that exhibits the reported issue.
Check here [https://gist.github.com/Elshaffei/cb1f13f1d79ccd6df0641b864420bc93](url)
- A stripped down, simplified version of y…
-
It'd be useful to have the same ability to record recognized noises, the same as we have for speech.
This could apply to the built-in recognizers and/or the parrot integration.
- Just as with sp…
-
As more models and better APIs become available, we are all going to need a config management interface to set the various components in Sunva backend and pipeline systems.
References:
https://gis…
-
For unclear reasons the voice-based lipsync does not work with Lucas-Arts speech style.
I cannot imagine the reason, as the only difference are character animation and the way text is displayed, ne…
-
Promised a long time ago. Now that we have translations and auto-voice implemented, we can implement interactive speech bubble editing support.
http://lemmasoft.renai.us/forums/viewtopic.php?f=4&t=10…
-
# Project Description
## Project Overview
Develop a _Speech to Text conversion_ system using **Azure AI Speech Studio** and Azure Cognitive Services. Leverage Azure's speech recognition services to …
-
Simple code:
`import speech_recognition as sr
r = sr.Recognizer()
with sr.Microphone() as source:
print("Say something!")
audio = r.listen(source)
print(r.recognize_google(audio))`…
-
Exception in thread Thread-2 (process_tts):
Traceback (most recent call last):
File "E:\metahuman-stream\venv\lib\threading.py", line 1016, in _bootstrap_inner
self.run()
File "E:\metahuma…