speech-representation Search Results

1000+ results
for speech-representation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

p0p4k/pflowtts_pytorch #14

descript/encodec is too slow in dataloader

Hi @p0p4k , I see that the process time of dac encode in `dev/descript_codec` branch is too slow on CPU in Dataloader. How can we speedup this process? ``` def batched_encodec(self, wav): …

vuong-ts updated 11 months ago
7
portabletext/portabletext #5

Nomenclature

"Style": "h1" Should not be "style", a heading is not a style, it's a semantic label for type of content in the text node. This property seems to have a confusing name. Style decoration (this shoul…

MattWilcox updated 4 years ago
3
AmmarLatef/typeandspeak #10

Add SSML support

``` Add SSML support for capable TTS engines. Need a list of SSML-capable engines. ``` Original issue reported on code.google.com by `web...@gmail.com` on 20 Apr 2012 at 6:19

GoogleCodeExporter updated 9 years ago
6
yudop/typeandspeak #10

Add SSML support

``` Add SSML support for capable TTS engines. Need a list of SSML-capable engines. ``` Original issue reported on code.google.com by `web...@gmail.com` on 20 Apr 2012 at 6:19

GoogleCodeExporter updated 9 years ago
6
presciencelabs/tabitha-sources #9

Move data from Sample.Features_Source into the sources proje…

formed from the pairing session in https://github.com/presciencelabs/tabitha-targets/issues/4#issuecomment-2125655542

longrunningprocess updated 3 months ago
5
anonymous-pits/pits #29

Trying ot use pitch predictor with different texts.

I know it is quite ackward to do this as the text encoder may produce latent variables which contains pitch, but I'm trying to use the reference linear spectrogram and posterior yingram encoder to gen…

ljh0412 updated 1 year ago
4
huggingface/distil-whisper #107

[Question] Can we distill for multiple langauges for distil-…

I have seen several distillations for different single languages for distil-whisper (like en, de etc). But I have yet to come across some distil-whisper which has been trained to be multilingual. For …

Killshot667 updated 6 months ago
3
tue-robotics-graveyard/yapykaldi #6

Grammar-guided recognition

Going from open-world to closed-world speech should improve the rate at which sentences can be parsed.

LoyVanBeek updated 4 years ago
20
digitallinguistics/scription #50

clarify utterance-level and word-level lines, and the relati…

Certain lines represent properties of an utterance (e.g. `\txn`, `\sp`), while other lines are properties of words (e.g. `\w`, `\wlt`) or morphemes (e.g. `\m`, `\gl`). Clarify which lines are of which…

dwhieb updated 1 year ago
1
yyf17/awesome-embodied-intelligent #1

SoundSpace

# [sound-spaces](https://github.com/facebookresearch/sound-spaces) [Project: RLR-Audio-Propagation](https://github.com/facebookresearch/rlr-audio-propagation) [Audio Sensor](https://github.com/f…

yyf17 updated 2 years ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for speech-representation

1000+ results
for speech-representation