speech-language-model Search Results

1000+ results
for speech-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dynamic-superb/dynamic-superb #102

[Task] Speech Emotion Captioning

# Speech Emotion Captioning Speech emotion captioning is to describe the emotion in speech using natural language. ## Task Objective Compared with traditional speech emotion recognition(wher…

ddlBoJack updated 2 months ago
9
homebrewltd/ichigo #26

idea: Text-to-music brainstorm August 2024

Proprietary music generation is far ahead of open source (see Suno, Udio et al). Using your encodec method, please include text-to-music with English synthetic Singing somehow. I'm not sure of the…

bennmann updated 1 week ago
2
coqui-ai/TTS #3836

[Bug] HELP HELP HELP

### Describe the bug I M trying to use this repo for urdu language i have found some pretrained module on hugging face but i m unable to use i dont have any prior knowledge of python i m not fami…

paisekamao updated 2 weeks ago
3
mkiol/dsnote #158

Strange insertion of words not resembling what I spoke, even…

Thank you very much for this wonderful program, it has very high accuracy levels and is helping me so much in many ways :) But unfortunately Speech Note keeps inserting words that I didn't speak ra…

Getarhubar updated 1 week ago
6
dynamic-superb/dynamic-superb #148

[Task] Phone-Phoneme segment counting

# Phone/Phoneme segment counting This task is to count the number of phoneme segments in a given speech sample. This task is essential for evaluating the ability of models in the benchmark to accurat…

eunjung31 updated 3 months ago
3
myshell-ai/OpenVoice #311

v2 does not work well with cosyvoice TTS

1. use CosyVoice Chinese woman to generate audio (first video), then use OpenVoice ToneColorConverter to generate audio(third video) according target_se(second video) that has serious electrical tone…

xipingL updated 1 week ago
1
SYSTRAN/faster-whisper #230

enable vad_filter cause timestamp mismatch

I tested some videos if the silence duration is long , then enable vad_filter will be effective but if video is as normal, then enable vad_filter may cause more timestamp mismatch is there …

iorilu updated 3 weeks ago
7
sc0ty/subsync #85

Dictionaries and speech recognition models requests

This is aggregated issue to request support for new languages. If you see one of the following errors: > Synchronization between languages xxx - yyy is currently not supported. > Synchronization …

sc0ty updated 3 weeks ago
19
homebrewltd/research #33

idea: Research API to collect data for Model Training

We want to: - Offer free LLM api to community - Build up open dataset - Collecting feedback of research team model Probably will be hosted on cloud

tikikun updated 1 week ago
1
rtvi-ai/rtvi-web-demo #6

Unable to change TTS language - French language setting not …

Currently, there is no functional way to change the Text-to-Speech (TTS) language in our application. While the system is intended to support French ("fr") as a language option, this setting is not be…

ttamoud updated 1 month ago
2

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for speech-language-model

1000+ results
for speech-language-model