speech-text-pretraining Search Results

182 results
for speech-text-pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lucidrains/spear-tts-pytorch #15

EOS token not predicted while training from scratch

I am currently training S1 from scratch as described in the paper as an ablation study. The paper states that the authors use a decoder only architecture and a 12-layer transformer as described in th…

Kodhandarama updated 8 months ago
3
modelscope/FunASR #1599

最新版funasr,按照教程fineturn 模型报错：forward() missing 4 required pos…

## 🐛 Bug：按照教程fineturn 模型报错：forward() missing 4 required positional arguments: 'speech', 'speech_lengths', 'text', and 'text_lengths' ### To Reproduce 按照教程https://github.com/alibaba-damo-academ…

Xsx93 updated 4 months ago
2
modelscope/FunASR #1935

No module named 'funasr.datasets.ms_dataset'

#### What is your question? 进行快速训练的时候出现了报错 ModuleNotFoundError Traceback (most recent call last) Cell In[21], line 10 7 from modelscope.trainers import build_traine…

Gin-Only updated 2 months ago
1
lwang114/UnsupTTS #1

applying on large single language dataset

Hello, I am trying to apply this procedure on a very large(900+) ours of speech. The text corpora is equally large, because the speech is theoretically transcrived but being newspaper scrape results i…

albluc24 updated 2 years ago
2
gitmylo/audio-webui #176

Installation Issue.

**Describe the bug** Audio-Webui does not install the requirements properly, precisely on audiolm, saying it failed to install. **To Reproduce** Steps to reproduce the behavior: 1. Go to 'audio-…

PericoSpart updated 1 month ago
3
facebookresearch/fairseq #4817

Outdated instructions to reproduce LID results with XLS-R

## 🐛 Bug I am trying to reproduce the results of the Language Identification task with the XLS-R model on the Voxligua107 dataset, but following the [current instructions](https://github.com/facebo…

urinieto updated 1 year ago
2
flashlight/text #86

NameError: name 'CriterionType' is not defined

I am trying to decode a fine-tuned ASR model,fine-tuned using the [vakyansh](https://github.com/Open-Speech-EkStep/vakyansh-wav2vec2-experimentation) toolkit using the [this](https://github.com/Open-S…

mukherjeesougata updated 6 months ago
1
heatz123/naturalspeech #10

Any plan of Reproducing mix-phoneme BERT ?

Hi there, First of all, I want to thank you for your great work. It has been incredibly helpful for us. However, I have noticed that you are not using the same phoneme encoder structure as in th…

TinaChen95 updated 1 year ago
3
facebookresearch/fairseq #3333

Recipe to use freshly released streaming models (Augmented-m…

- fairseq Version : master - PyTorch Version 1.7 - OS (e.g., Linux): Linux - How you installed fairseq : git clone - Python version: 3.8.5 - CUDA/cuDNN version: 10.2 - GPU models and conf…

vpellegrain updated 2 years ago
5
microsoft/SpeechT5 #56

pretrain loss

Excuse me, what value does my pre-training loss reach, can I start fintune tts? i found my finued tts model can generate a mel-spectrom but diffrent to ori mel-spectrom very much。 Is this due to…

MarsMeng1994 updated 7 months ago
4

上一页 1...1 2 3 4 5 6 7...19 下一页

182 results for speech-text-pretraining

182 results
for speech-text-pretraining