vocoders Search Results

348 results
for vocoders

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

speechbrain/speechbrain #2573

Cannot reproduce the result in speech translation

### Describe the bug Hi, I tried to reproduce the result of CVSS speech translation, but I got verty high training loss at around epoch 18. The loss is around 200. I just follows the instruction …

chentuochao updated 6 days ago
6
babysor/MockingBird #289

分享两个训练好的synthesizer模型

俩模型都是用最新的代码来训练的，不需要切换回0.0.1 第一个模型synthesizer-merged_110k，是在代码支持的四个数据集（aidatatang_200zh，magicdata，aishell3，data_aishell）上联合训练的。learning rate=0.001无衰减，batch size=128，iteration=110k。第二个模型synthesizer-z…

wrk226 updated 6 months ago
13
DanRuta/xVA-Synth #11

README.md instructions are not sufficient for a fresh instal…

I tried setting this up on a fresh linux install and found that the steps in the readme are incomplete to set this up from scratch. The steps mentioned are: ``` npm install npm start # source $VIR…

harshhpareek updated 1 year ago
8
csun22/Synthetic-Voice-Detection-Vocoder-Artifacts #6

Name of vocoders training datasets

Hello, Thank you for sharing this dataset. Would it be possible to have more information on the generation of audios? In particular, the names of the vocoders training datasets used. Thank yo…

Hactogeek updated 2 months ago
1
espnet/espnet #5126

Poor performance on aishell3 with gst+xvector+tacotron2 conf…

I ran aishell recipe with this [gst + xvector +tacotron2](https://github.com/espnet/espnet/blob/master/egs2/aishell3/tts1/conf/tuning/train_gst%2Bxvector_tacotron2.yaml) configuration. However the clo…

keshawnhsieh updated 1 year ago
2
bshall/knn-vc #38

Using better neural vocoder GAN (BIGVGAN, RVQGAN ...)

Anyone tried to use a different, more recent (and supposedly) better vocoder ? HiFI-Gan is already a bit old and better options appeared like [BIGVGAN](https://github.com/NVIDIA/BigVGAN) and maybe …

tcourat updated 1 month ago
1
kaiidams/NeMoOnnxSharp #26

Possible to improve English and German pronunciation?

# NVIDIA NeMo (ByT5 G2P and G2P-Conformer): > NVIDIA NeMo provides grapheme-to-phoneme models for various languages, including **German**. > The ByT5 G2P model is based on a neural network and can…

GeorgeS2019 updated 3 months ago
9
pytorch/audio #2696

Add support for Modified Discrete Cosine Transform (MDCT)

### 🚀 The feature The [Modified Discrete Cosine Transform (MDCT)](https://en.wikipedia.org/wiki/Modified_discrete_cosine_transform) is a perfectly invertible transform that can be used for featur…

Kinyugo updated 1 year ago
16
pytorch/pytorch #96686

No GPU found, using CPU during preprocessing Error processin…

### 🐛 Describe the bug Description I'm trying to process a dataset using the extract_features.py script in Python, which uses the NsfHifiGAN model to generate audio features. However, when I run…

ThePopDiva93 updated 1 year ago
3
CSTR-Edinburgh/magphase #1

Adding magphase to Merlin configuration.py, output dims?

In the script that extracts features for magphase, it says typically it extracts 60 mag, 45 real, and 45 imag features. I am using 48kHz audio, just like in the script. So are those numbers correct th…

dreamk73 updated 6 years ago
7

上一页 1...3 4 5 6 7 8 9...35 下一页

348 results for vocoders

348 results
for vocoders