speech-generation Search Results

1000+ results
for speech-generation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/RAG_Hack #116

VidSage: Video Insights using Graph RAG

### Project Name VidSage ### Description # VidSage: Video Insights using Graph RAG https://www.youtube.com/watch?v=IUSCWtB9jWk VidSage focuses on processing video data, storing it in Azur…

MayankKeshariC5 updated 3 weeks ago
1
dnhkng/GlaDOS #47

[feature] Ability to use AnyGPT for speech/text/image/music …

AnyGPT is quite a promising project released 2 months before GPT4o. It is a versatile multimodal *LLaMA-based* model, which is able not only to take images as an input, but also non-transcribed spe…

kabachuha updated 6 months ago
2
plussub/plussub #77

Convert subtitles to speech with tts reader to add speech sy…

Hello, thank you very much for the absolutely awesome fantastic wonderful great our beloved Plussub ! 🥇 💯 Please we have dream : we can download anime in Japanese from streaming sites which have 7…

trimechee updated 1 month ago
2
microsoft/RAG_Hack #160

Project: Interactive Learning Platform

### Project Name Curio ### Description ## ✨Curio Curio is a personalised learning platform which uses Retrieval-Augmented Generation (RAG) to generate interactive audio lessons that engage users i…

lilbandit updated 3 weeks ago
3
Azure-Samples/cognitive-services-speech-sdk #2510

TTS: Excessive silence at the end of audio generated using g…

**Describe the bug** Audios generated for `gu-IN` locale using voice `gu-IN-DhwaniNeural` contains about 3 sec silence at the end of audio file. The same generation, performed using `gu-IN-NiranjanNe…

luzhanov updated 3 months ago
1
shrimai/Focused-Attention-Improves-Document-Grounded-Generation #8

The requested URL /document_grounded_generation/cmu_dog/cmu_…

i cannot find the pretrained models is this link http://tts.speech.cs.cmu.edu/document_grounded_generation/cmu_dog/cmu_dog.zip

lalisaa updated 1 year ago
2
PantoMatrix/PantoMatrix #150

Custom Audio Inference issues /自定义音频推理及应用问题请教

Hello, thanks for your great work! I have encountered several problems during the reproduction process and would like to ask for advice: 1. I tried to generate actions using my own audio and used M…

Ancolie18 updated 1 month ago
10
dynamic-superb/dynamic-superb #7

[Task] Text-to-Speech Synthesis

# Text-to-Speech Synthesis Text-to-Speech is a speech generation task that converts written language into its spoken form. ## Task Objective Text-to-Speech Synthesis (TTS) is an essential ta…

kuan2jiu99 updated 1 year ago
1
neulab/ExplainaBoard #56

New task: speech recognition

Speech recognition is a standard generation task where the input is speech, output is text. For now, analysis could be done on the output side only. * Evaluation metric: word error rate, character …

neubig updated 2 years ago
2
Weilbyte/tiktok-tts #45

TTS API Requests

Hey Weilbyte, I have a friend who has a speech impairment, who doesn't like using discords tts as its a mans voice, I was wonder if you would be willing support an api where I can request text in t…

JoshoNZ updated 7 months ago
4

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for speech-generation

1000+ results
for speech-generation