-
```
path = r"D:\Project\Python_Project\FasterWhisper\large-v3"
model = WhisperModel(model_size_or_path=path, device="cuda", local_files_only=True)
segments, info = model.transcribe("audio.wav",…
-
The video I am downloading has multiple Audio options, specifically Portuguese, English, and Spanish, and there isnt an option to choose which audio to use so its defaulting to Spanish, when I want it…
-
#### BUILD AN APP ####
an AI powered assistant that takes a USERINPUT which could be:
1. Recorded Audio (via an in app or on device record-no feature OR upload audio file)
2. 2. Manual text ent…
-
HI,
Thank you for your amazing work.
when I try with multilingual audio files then I get the below error,
Please advise, where can I set the language config here
![image](https://github.com/Va…
-
I figured out that the new relevant url seems to be in the form of `https://www.crunchyroll.com/content/v2/cms/series/GRGG9798R/seasons?force_locale=&preferred_audio_language=ja-JP&locale=de-DE` (in c…
Yavos updated
2 weeks ago
-
### Checks
- [X] This template is only for question, not feature requests or bug reports.
- [X] I have thoroughly reviewed the project documentation and read the related paper(s).
- [X] I have search…
-
I’m encountering challenges with fine-tuning XTTS for Moroccan Darija despite having a high-quality extensive dataset.
Dataset: 1,000 hours of Moroccan Darija audio, segmented into 1–5 second clips, …
-
The logs of the Multimodal agent with OpenAI Realtime API show correct English text on the console but on the UI, it sometimes shows the audio transcription in other languages like Hindi, Chinese, Rus…
-
Thank you for the paper and code.
Voice cloning is excellent, however, for text to speech I'm experiencing word errors. Here is an example
[Prompt audio file](https://github.com/user-attachments…
-
when I use
```
echogarden align-transcript-and-translation 01.mp3 01.txt 01_translate.txt 01.json 01.srt
```
I got
```
Echogarden v1.5.0
Start stage 1: Align speech to transcript
Transcode…