speech-representation Search Results

espeak-ng/espeak-ng #617

Generating speech from phonemes? / Phoneme representation sp…

Hello, Can espeak-ng take IPA phonemes as input or only its custom phoneme representation (eg espeak-ng "[[h@´loU]]")? Is there a specification or documentation of this phoneme representation or…

delthas updated 11 months ago

openvinotoolkit/openvino #26845

[Feature Request]: Support Moshi speech-text foundation and …

### Request Description Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, …

atlury updated 2 weeks ago

TensorSpeech/TensorFlowASR #49

SimCLR loss for self-supervised speech representation learni…

Hi, Self-supervised pretraining for speech representation is a promising technique for developing ASR in resource-constraint languages with little transcribed data, and SimCLR is applied with success…

monatis updated 3 years ago

pytorch/pytorch #94855

torch.compile breaks reproducibility

### 🐛 Describe the bug Adding torch.compile does not ensure deterministic results after setting a seed (and ensuring all the steps here: https://pytorch.org/docs/stable/notes/randomness.html#:~:text=…

rahul-1996 updated 2 months ago

ankitapasad/layerwise-analysis #7

About the selection of glove

Hello, I would like to ask why the choice of glove embeddings is Common Crawl and the choice of agwe embeddings is librispeech in the code. Shouldn't the choice of glove embeddings also be librispeech

futian00 updated 3 weeks ago

charlesLoder/hebrew-transliteration #87

Using for tts

First of all, thanks for your great work. it's very interesting! I would like to use this package to prepare data which I'll use to train text to speech Hebrew model, Can you tell what's the best …

thewh1teagle updated 1 month ago

OMDoc/OMDoc #371

Representation of the part of speech of mathematical terms/s…

_migrated from Trac, where originally posted by **clange** on 7-Oct-2010 10:42am_ [The SlugMath semantic wiki for mathematical course notes](http://slugmath.ucsc.edu/mediawiki/index.php/Category:Lexi…

jbs1 updated 8 years ago

OpenPecha/Requests #361

RFW0122: Text-to-Speech (TTS) with Diverse Accents and Gende…

# RFW0122: Text-to-Speech (TTS) with Diverse Accents and Gender ## Summary The goal of this RWF is to expand our existing Text-to-Speech (TTS) to encompass a wider range of accents and genders …

lobsam updated 11 months ago

Lhx94As/PHO-LID #2

seq

hi，I want know how to set the T'_i ,I have extract speech representation

whh07141 updated 7 months ago

dynamic-superb/dynamic-superb #41

[Task] Speech Command Recognition - AudioMNIST

# Task Name Spoken digit recognition - AudioMNIST ## Task Objective The task's objective is to classify audio samples of spoken digits (0-9) into their corresponding Arabic number representat…

dlion168 updated 3 months ago

1000+ results for speech-representation

1000+ results
for speech-representation