-
```
def generate_speaker_tensor(mean: float = 0.0, std: float = 15.247) -> torch.Tensor:
return torch.normal(mean, std, size=(768,))
def generate_speaker_tensor_a() -> torch.Tensor:
st…
craii updated
1 month ago
-
README文件更新一下!!!
代码更新一下!!!
issues里面的问题有的说改了但是我pull最新的代码发现并没有同步,就比如torchaudio库换soundfile,infer_file中[]的问题,do_text_normalization=False说明一下
还有就是评论有的人让下载conda,之前你不用这个管理python环境的完全不需要!!!
中文没问题,正常读了!!!!
…
-
具体思路:
**1. 生成rand_spk并把weight保存到csv文件:**
```
import torch
import csv
std, mean = torch.load("spk_stat.pt').chunk(2)
rand_spk = torch.randn(768) * std + mean
writeToCsv(f"saved.csv",rand_spk…
-
-
Collecting deepspeed==0.12.4 (from resemble-enhance)
Using cached deepspeed-0.12.4.tar.gz (1.2 MB)
Preparing metadata (setup.py) ... error
error: subprocess-exited-with-error
× python se…
-
时长最长30秒是吗
-
长文本的音色是乱的,不固定
-
hi,我想咨询下data 路径下speaker增加的方法。是使用了VQ Encoder 将speaker的语音转换成了embedding并保存为pt文件吗?还是有其他别的方法?谢谢~
-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
It is quite obvious that you've bought your github stars and forks.
A project no more than 4 days old, having exactly 10k stars and almost exactly 1000 forks? Come on guys, it's a disgrace that you h…