-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
1. document of [torch.nn.CTCLoss](https://pytorch.org/docs/stable/generated/torch.nn.CTCLoss.html#torch.nn.CTCLoss) says: the input is a **logarithmized probabilities**, obtained with `torch.nn.funct…
-
I've read some issues about mps of pytorch, it turns out that currently mps doesn't support complex types (like 1+2j). But I think svc requires complex types. One of the current solution is adding a.t…
-
Do you have a pre-training model? I want to save time on training.
And what about your training hours with the epoch=100.
-
Hello. I downloaded the pretrained modal `ljspeech v3.1` and when I try to run `python gen_forward.py --alpha 1 --checkpoint pretrained-forward_step90k.pt --input_text 'this is whatever you want it to…
-
Great work! This singing synthesis effect is the best I have heard so far, and the paper is also very scientific. Will you release the code?
-
感谢开源!!最近在关注soft-vc,对于自有数据集建模有两个疑问:
1 自有数据集训练声学模型,用到的units来源,是来自于原作者预训练的hubert_soft吗?这个是基于英文数据集训练的,在做中文VC任务的时候,这个可以直接用于提取units吗?
2 此项目开源的databaker音色的声码器是通用的还是在databaker数据集上finetune过?
-
Hi authors,
I am trying to train low-bit-rate codecs for 8kHz sample rate audio, specifically targeting bit rates of 3.2 kbps, 1.8 kbps, and 1.2 kbps. I set the hop_size to 240 (which factors as 2 …
-
The hparams.py says `n_frames_per_step=1, # currently only 1 is supported`, but reduction window is very important for them model to pick up alignment. Using a reduction window can be considered as d…
-
I like the book but can't seem to install the software properly. That is a shame as it detracts from what is otherwise an excellent book! Many similar book suffer from this issue. I think more detail …
Mel65 updated
2 years ago