audio-to-text Search Results

1000+ results
for audio-to-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

abdeladim-s/pywhispercpp #49

Performance Improvement ideas / feature requests

As promised, here's the thread I'm making for this. RE: pre-processing: In ```pywhispercpp/model.py``` we have ```transcribe``` and it can take a numpy ndarray. What I was thinking is, rather th…

UsernamesLame updated 3 hours ago
94
huggingface/parler-tts #114

Stream and play

It's so appealing for me to use [streaming](https://github.com/huggingface/parler-tts/blob/8e465f1b5fcd223478e07175cb40494d19ffbe17/INFERENCE.md?plain=1#L158). ``` for (sampling_rate, audio_chunk)…

RobinWitch updated 1 week ago
4
ajitesh123/Perf-Review-AI #52

Audi support from issue

Add the ability for users to provide audio input for their performance reviews and self-reviews. Previously, users had to type their input, but now they can record their audio. You can use streamlit a…

ajitesh123 updated 2 months ago
1
Azure-Samples/cognitive-services-speech-sdk #2542

Text cached without audio chunks return when doing text stre…

**IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:** - Speech SDK log taken from a run that exhibits the reported issue. [azure_speeck_sdk.zip](https://github.com/user-attachments/files/1662…

steven8274 updated 3 days ago
3
2noise/ChatTTS #659

IMPORTANT: Why Does Input Text Appear Garbled, Leading to 'S…

Why does the input text, whether in English or Chinese, appear as garbled text during processing, ultimately leading to the error message 'Segmentation fault (core dumped)' and failing to generate an …

ckgithub2019 updated 1 month ago
3
2noise/ChatTTS #697

Strange output of audio (generated audio does not match inpu…

I am trying to generate voice but the generated output is strange. it has unwanted sentences in the generated audio. why is that? this does not happen everytime through. like in the code, the 2nd a…

samiulextreem updated 2 weeks ago
1
livepeer/bounties #52

Stable Audio Pipeline implementation Bounty [$850]

# Overview To enhance the feature set of our [ai-network](https://docs.livepeer.org/ai/pipelines/overview#models-on-the-ai-subnet/), we aim to implement a `text-to-audio` pipeline using the [Stable…

JJassonn69 updated 1 week ago
2
SYSTRAN/faster-whisper #988

faster-whisper vs whisper: PyAV stops during decode, ffmpeg …

The audio file is corrupted at the end, so an error is expected during decode process. However, PyAV stop processing while whisper using ffmpeg process the file until the corrupted are is detected. …

rodrigofvale updated 3 days ago
2
2noise/ChatTTS #704

new version load model and voice pt model not work

when use new version, something occurred to me: 1. load model cannot use ```py chat.load(source="custom", custom_path=MODEL_PATH, device='cpu', compile=False) ``` and meet the question…

LivinLuo1993 updated 1 week ago
3
yanus171/Handy-News-Reader #942

podcast processing - BBC learning english

**Is your feature request related to a problem? Please describe.** BBC's Learning English podcasts are not properly processed **Describe the solution you'd like** improve the "podcast parsing" fe…

gautxori-yuyu updated 2 weeks ago
2

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for audio-to-text

1000+ results
for audio-to-text