-
As promised, here's the thread I'm making for this.
RE: pre-processing:
In ```pywhispercpp/model.py``` we have ```transcribe``` and it can take a numpy ndarray. What I was thinking is, rather th…
-
It's so appealing for me to use [streaming](https://github.com/huggingface/parler-tts/blob/8e465f1b5fcd223478e07175cb40494d19ffbe17/INFERENCE.md?plain=1#L158).
```
for (sampling_rate, audio_chunk)…
-
Add the ability for users to provide audio input for their performance reviews and self-reviews. Previously, users had to type their input, but now they can record their audio. You can use streamlit a…
-
**IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:**
- Speech SDK log taken from a run that exhibits the reported issue.
[azure_speeck_sdk.zip](https://github.com/user-attachments/files/1662…
-
Why does the input text, whether in English or Chinese, appear as garbled text during processing, ultimately leading to the error message 'Segmentation fault (core dumped)' and failing to generate an …
-
I am trying to generate voice but the generated output is strange. it has unwanted sentences in the generated audio. why is that?
this does not happen everytime through. like in the code, the 2nd a…
-
# Overview
To enhance the feature set of our [ai-network](https://docs.livepeer.org/ai/pipelines/overview#models-on-the-ai-subnet/), we aim to implement a `text-to-audio` pipeline using the [Stable…
-
The audio file is corrupted at the end, so an error is expected during decode process. However, PyAV stop processing while whisper using ffmpeg process the file until the corrupted are is detected.
…
-
when use new version, something occurred to me:
1. load model
cannot use
```py
chat.load(source="custom", custom_path=MODEL_PATH, device='cpu', compile=False)
```
and meet the question…
-
**Is your feature request related to a problem? Please describe.**
BBC's Learning English podcasts are not properly processed
**Describe the solution you'd like**
improve the "podcast parsing" fe…