wavlm Search Results - Githubissues

287 results
for wavlm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #102999

Preserve weight_g/weight_v accessors on new weight_norm

### 🐛 Describe the bug Parametrizations don't let you control what the original parameters are called; they're always original0, original1, etc. For weight_norm, this new naming is a bit obtuse; th…

ezyang updated 2 months ago
15
OlaWod/FreeVC #79

poor performance on seen-to-unseen task while finetuning on …

Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoint…

rgenai updated 7 months ago
2
jasonppy/VoiceCraft #130

Speaker Similarty

Can you please provide codes and WAVLM-TDCNN model weights for calculating speaker similarity score?

QajikHakobyan updated 1 month ago
3
microsoft/Swin-Transformer #170

Swin Transformer is added to HuggingFace Transformers

Hi Swin Transformer team, We've recently added Swin Transformer to HuggingFace Transformers: https://huggingface.co/docs/transformers/master/en/model_doc/swin. All checkpoints are on the hub: h…

NielsRogge updated 2 years ago
2
pyannote/pyannote-audio #1727

3.3 dependencies

### Tested versions - 3.3 ### System information macOS, m1 ### Issue description Installing the most recent 3.3 version, trying out the new pixit pipeline i get the following errors (after downgr…

faroit updated 2 weeks ago
2
YoungSeng/UnifiedGesture #6

代码复现问题

在【用Trinity和ZEGGS数据集进行训练时，train.py文件】方面遇到了一些困难， ![image](https://github.com/YoungSeng/UnifiedGesture/assets/37477030/7a138d2f-cc8f-4701-afea-32c40b4a3167) 其中loss的值一直为“nan”，不理解为什么会是这个值

YoungSeng updated 4 months ago
33
OlaWod/FreeVC #5

好奇一个问题

您好，拜读了论文，想了解一下，模型的参数数量和运行时间大概是多久呢？或者说转换一秒的语音在3090显卡上需要多久的运算时间呢。

guoyingying432 updated 1 year ago
3
bshall/knn-vc #37

Question about the Used Hardware

Thanks for the great work. Knn-VC produces great results without even the need of training. One thing I noticed is, I canned use more than 3 minutes of refernce audio. If I use like 5 minutes of au…

CCMaure updated 1 month ago
1
espnet/espnet #5783

How to extract voice embeddings?

Hello. I have speech recordings in wav files, about 1-5 minutes each. How do we extract embeddings using the `espnet/voxcelebs12_ecapa_wavlm_joint` SOTA model? Documentation is overcomplicated.…

maxpain updated 1 month ago
6
coqui-ai/TTS #3797

[Bug] tts.tts_with_vc_to_file cannot use cpu

### Describe the bug Similar to #3787, but also when running `xtts_v2` model with voice cloning (vocoder model), using `device='cpu'` results to the following error: ``` RuntimeError: CUDA error: …

pieris98 updated 1 week ago
2

上一页 1...3 4 5 6 7 8 9...29 下一页

287 results for wavlm

287 results
for wavlm