-
I am currently training S1 from scratch as described in the paper as an ablation study.
The paper states that the authors use a decoder only architecture and a 12-layer transformer as described in th…
-
## 🐛 Bug:按照教程fineturn 模型报错:forward() missing 4 required positional arguments: 'speech', 'speech_lengths', 'text', and 'text_lengths'
### To Reproduce
按照教程https://github.com/alibaba-damo-academ…
Xsx93 updated
4 months ago
-
#### What is your question?
进行快速训练的时候出现了报错
ModuleNotFoundError Traceback (most recent call last)
Cell In[21], line 10
7 from modelscope.trainers import build_traine…
-
Hello, I am trying to apply this procedure on a very large(900+) ours of speech. The text corpora is equally large, because the speech is theoretically transcrived but being newspaper scrape results i…
-
**Describe the bug**
Audio-Webui does not install the requirements properly, precisely on audiolm, saying it failed to install.
**To Reproduce**
Steps to reproduce the behavior:
1. Go to 'audio-…
-
## 🐛 Bug
I am trying to reproduce the results of the Language Identification task with the XLS-R model on the Voxligua107 dataset, but following the [current instructions](https://github.com/facebo…
-
I am trying to decode a fine-tuned ASR model,fine-tuned using the [vakyansh](https://github.com/Open-Speech-EkStep/vakyansh-wav2vec2-experimentation) toolkit using the [this](https://github.com/Open-S…
-
Hi there,
First of all, I want to thank you for your great work. It has been incredibly helpful for us.
However, I have noticed that you are not using the same phoneme encoder structure as in th…
-
- fairseq Version : master
- PyTorch Version 1.7
- OS (e.g., Linux): Linux
- How you installed fairseq : git clone
- Python version: 3.8.5
- CUDA/cuDNN version: 10.2
- GPU models and conf…
-
Excuse me, what value does my pre-training loss reach, can I start fintune tts?
i found my finued tts model can generate a mel-spectrom but diffrent to ori mel-spectrom very much。
Is this due to…