-
I just tried to use and test your model, unfortunately I only have a GPU with 16GB of RAM. Apparently WavLM takes about 12 GB and HifiGAN needs another 5 GB so you need at least 20GB of RAM to run inf…
-
### System Info
- `transformers` version: 4.36.0.dev0
- Platform: Linux-5.15.120+-x86_64-with-glibc2.35
- Python version: 3.10.12
- Huggingface_hub version: 0.17.3
- Safetensors version: 0.4.0
-…
-
你好,我在使用`DiffuseStyleGesture+`进行推理时,写bvh出现了问题,涉及到的代码为[code](https://github.com/YoungSeng/DiffuseStyleGesture/blob/master/BEAT-TWH-main/process/pymo/writers.py#L57), 列表`self.motions_`前6个元组的形状为(num_frame…
-
Looks like, unlike FreeVC https://github.com/OlaWod/FreeVC/blob/81c169cdbfc97ff07ee2f501e9b88d543fc46126/data_utils.py#L72C9-L72C9 your code doesn't explicitly use this param, in fact it doens't use S…
-
I encountered a strange bug or rather a strange behaviour, which I can not really pinpoint to the exact issue.
I used the standard training, as you described and it worked fine. However when I change…
-
This is a really good project. I was wondering if WavLM is supported in the project, I wanted to run a voice conversation model in the browser, also if Hifi-gan for voice synthesis.
-
Share your Chinese synthesis results or mandrain model training questions.
-
I sometimes experience a bug when performing matching with big datasets (20k samples+).
This is the Stacktrace:
```
Feature has shape: torch.Size([445, 1024])----------------------------------…
-
音频是15s一个文件
格式是FormatFactoryPart1
文本预处理和生成bert都成功了,但是训练总是空跑,信息如下
加载config中的配置0
加载config中的配置localhost
加载config中的配置10086
加载config中的配置0
加载config中的配置1
加载环境变量
MASTER_ADDR: localhost,
MASTER_PORT…
-
Dear author, your article and code are very helpful to me, and I will also cite your article in my paper later. Could you please upload the data processing and training part about meld data set? I can…
MF-XU updated
7 months ago