mel-gan Search Results - Githubissues

350 results
for mel-gan

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

GasimV/Commercial_Projects #2

Speech Processing Models

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 4 months ago
16
espnet/espnet #5179

[TTS-JETS] about CTCLoss?

1. document of [torch.nn.CTCLoss](https://pytorch.org/docs/stable/generated/torch.nn.CTCLoss.html#torch.nn.CTCLoss) says: the input is a **logarithmized probabilities**, obtained with `torch.nn.funct…

BridgetteSong updated 1 year ago
6
voicepaw/so-vits-svc-fork #337

MPS Support

I've read some issues about mps of pytorch, it turns out that currently mps doesn't support complex types (like 1+2j). But I think svc requires complex types. One of the current solution is adding a.t…

escoolioinglesias updated 1 year ago
14
karchkha/MelSpec_VQVAE #1

Pretraining model？

Do you have a pre-training model? I want to save time on training. And what about your training hours with the epoch=100.

a897456 updated 8 months ago
15
as-ideas/ForwardTacotron #110

Cast error details: Unable to cast [Array] to Tensor

Hello. I downloaded the pretrained modal `ljspeech v3.1` and when I try to run `python gen_forward.py --alpha 1 --checkpoint pretrained-forward_step90k.pt --input_text 'this is whatever you want it to…

Tony-Starkus updated 4 months ago
9
zhangyongmao/VISinger2 #1

Will the code be open-sourced?

Great work! This singing synthesis effect is the best I have heard so far, and the paper is also very scientific. Will you release the code?

980202006 updated 1 year ago
32
Kahsolt/soft-vc-acoustic-models #1

关于databaker数据集的疑问

感谢开源！！最近在关注soft-vc，对于自有数据集建模有两个疑问： 1 自有数据集训练声学模型，用到的units来源，是来自于原作者预训练的hubert_soft吗？这个是基于英文数据集训练的，在做中文VC任务的时候，这个可以直接用于提取units吗？ 2 此项目开源的databaker音色的声码器是通用的还是在databaker数据集上finetune过？

superhg updated 1 year ago
2
facebookresearch/AudioDec #32

feature matching loss and adversarial loss rise steadily?

Hi authors, I am trying to train low-bit-rate codecs for 8kHz sample rate audio, specifically targeting bit rates of 3.2 kbps, 1.8 kbps, and 1.2 kbps. I set the hop_size to 240 (which factors as 2 …

Chengbin-Liang updated 1 month ago
7
NVIDIA/tacotron2 #280

reduction window is vital for the model to pick up alignment…

The hparams.py says `n_frames_per_step=1, # currently only 1 is supported`, but reduction window is very important for them model to pick up alignment. Using a reduction window can be considered as d…

bfs18 updated 1 year ago
89
moelgendy/deep_learning_for_vision_systems #31

Installation

I like the book but can't seem to install the software properly. That is a shame as it detracts from what is otherwise an excellent book! Many similar book suffer from this issue. I think more detail …

Mel65 updated 2 years ago
3

上一页 1...5 6 7 8 9 10 11...35 下一页

350 results for mel-gan

350 results
for mel-gan