text-to-audio Search Results

1000+ results
for text-to-audio

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cbmflushing/transcription #1

explore self-hosted transcription solutions

Goal is to have transcription be done for .mp3 file into text file. We want to see what transcription solution is best so that the process can be added into our automation chain The idea is to integ…

a-chen updated 2 months ago
5
suno-ai/bark #527

max num tokens supported on inference is ~ 40 max. not 256 a…

max number of tokens I am able to run thru bark generate_text_semantic() is about 40, ~ 24 words or so. I looked thru the code and noticed that generate_text_semantic() clips anything over 256 and…

xvdp updated 8 months ago
2
Stability-AI/stable-audio-tools #45

PicklingError: Can't pickle <function get_custom_metadata at…

Hey, if i want to train my model with costum audios and promts via metadata i just get this traceback: > PicklingError: Can't pickle : import of module 'metadata_module' failed Traceback (most re…

TheZaind updated 5 months ago
3
Azure-Samples/cognitive-services-speech-sdk #2510

TTS: Excessive silence at the end of audio generated using g…

**Describe the bug** Audios generated for `gu-IN` locale using voice `gu-IN-DhwaniNeural` contains about 3 sec silence at the end of audio file. The same generation, performed using `gu-IN-NiranjanNe…

luzhanov updated 3 months ago
1
genesis-ai-dev/codex-editor #41

Enable multiple options for chunking text

It would be ideal to enable the user to convert the current draft files from "one chapter per cell" to "one verse per cell", "interlinear", "one pericope per cell", "one book per cell", etc. We need …

ryderwishart updated 2 months ago
1
huggingface/community-events #197

Increasing WER & Validation Loss During Whisper Fine-Tuning

Hi, I've recently created a dataset using speech-to-text APIs on custom documents. The dataset consists of 1,000 audio samples, with 700 designated for training and 300 for testing. In total, this eq…

monk1337 updated 1 month ago
2
Zulko/moviepy #1776

Does swapping text mid video work? Cutout/subclip fail on Te…

I cannot get text swapping to work during video composition, I have tried both cutout(l1, l2) and subclip(l1, l2) with no success as seen below. My output is a video with all 3 texts laying over eacho…

ffffffffffffhhhhhhhhhhhh updated 2 years ago
1
C0untFloyd/bark-gui #102

TypeError: Audio.__init__() got an unexpected keyword argume…

Loading text model from ./models\text_2.pt to cuda Loading coarse model from ./models\coarse_2.pt to cuda Loading fine model from ./models\fine_2.pt to cuda Launching Bark UI Enhanced v0.7.4 Server…

web2299 updated 7 months ago
1
clamsproject/mmif #231

proposing subtypes of `TextDocument`

### New Feature Summary With a number of recent development, I'd like to propose more vocab types that are subcategories of `TextDocument` (all names are tentative in the proposal) - `Transcript`:…

keighrim updated 4 months ago
10
schreibfaul1/ESP32-audioI2S #916

Metadata encoding

First of all, I have to thank you for the great library, without which I can't imagine working on my project. I'm using the stable version 3.0.12. I've encountered one problem with ID 3 tags of MP3 …

Pako2 updated 3 days ago
9

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for text-to-audio

1000+ results
for text-to-audio