issues
search
microsoft
/
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
MIT License
1.09k
stars
113
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
What languages are supported? How to specify a language?
#82
secsilm
opened
1 week ago
0
SpeechUT does not have a link for download
#81
world1tree
opened
4 weeks ago
2
What's the model_path and data_name on inference code?
#80
YepJin
opened
1 month ago
1
Confusion/Question about SpeechT5SpeechDecoderPostnet output
#79
Student204161
opened
2 months ago
0
Error in loading WavLLM model
#78
rishabh004-ai
opened
2 months ago
9
Single Task Training
#77
yangjiabupt
closed
2 months ago
1
WavLLM checkpoint
#76
ming024
opened
2 months ago
5
ASR fine-tuning loss goes to zero after several epochs
#75
yunigma
closed
2 months ago
2
extract transorformer layer feature
#74
zbpjlc
opened
4 months ago
2
Does the pre-trained model for hidden unit tokenizer use speaker embeddings?
#73
Kodhandarama
opened
4 months ago
0
What is the time taken to converge for the hidden unit tokenizer?
#72
Kodhandarama
opened
4 months ago
0
Link to train_960.tsv is broken
#71
Kodhandarama
opened
5 months ago
0
"SpeechT5" on Android OS
#70
taeyeonlee
opened
5 months ago
0
British English TTS model
#69
omega3
closed
2 months ago
1
Text feature extraction using SpeechLM
#68
wonjune-kang
opened
6 months ago
0
Baseline implementation
#67
ussenuk
opened
7 months ago
1
How to setting language when do S2T
#66
nhha1602
opened
7 months ago
1
是否支持中文转语音?
#65
xxm1668
opened
7 months ago
4
The size of tensor a (674) must match the size of tensor b (600) at non-singleton dimension 1
#64
poojitharamachandra
opened
8 months ago
1
SpeechT5 - TTS - Tokenizer adding `▁` token between newly added Vietnamese characters
#63
GinUTE
closed
6 months ago
1
ASR SpeechT5 training - model predicts same output for different inputs
#62
L7uan
opened
9 months ago
0
Is end-to-end S2ST possible with Speecht5?
#61
elia-ashraf
opened
9 months ago
0
Generate the N-best (top few) hypotheses
#60
cyfer0618
opened
10 months ago
0
Reproduce ASR experiment results in Hugging Face
#59
jjyaoao
closed
11 months ago
0
Voice Conversion - Error with Some Mono, 16kHz, 16bit Audio
#58
fabiocat93
opened
11 months ago
3
Getting TTS output voice close to the training data - Finetuning on different language
#57
Srija616
opened
11 months ago
2
pretrain loss
#56
MarsMeng1994
opened
12 months ago
4
Bump scipy from 1.5.4 to 1.10.0 in /VATLM/vat_hubert
#55
dependabot[bot]
opened
12 months ago
0
VATLM: Error when loading finetuned checkpoints for infer_s2s
#54
naraysa
opened
1 year ago
0
Pretraining SpeechT5, meet problems about batch_sampler in multitask_dataset. Should I get idx and bin files of data one by one (wav) or get all of them in only two file(idx and bin each have one)
#53
Lemonaddeee
opened
1 year ago
0
SpeechUT inference error in en_fr checkpoint
#52
ytf-philp
opened
1 year ago
1
Using SpeechT5 Large for TTS
#51
imranmaj
opened
1 year ago
0
SpeechT5: extracting Chinese speaker embedding
#50
QQ-777777
opened
1 year ago
6
SpeechT5-tts fine-tuned on Chinese
#49
qlmbeck
opened
1 year ago
4
add link to Hugging Face fine-tuning example
#48
hollance
closed
1 year ago
1
The link for Prosody-SpeechT5 in the Readme is dead/404
#47
svantana
closed
1 year ago
2
SpeechLM
#46
blueblue-bubble
closed
1 year ago
2
SpeechT5:how much epoch is set
#45
QQ-777777
closed
1 year ago
5
how to pause between two words ?
#43
hulk10425
opened
1 year ago
2
how to fine tune sid on pretrained model?
#42
haha010508
closed
1 year ago
11
hydra fine-tunning for speechT5?
#41
ramonsanabria
opened
1 year ago
0
[SpeechLM] About phoneme tokenizer in detail?
#40
yuseungwoo
closed
1 year ago
1
reproduction steps for inference
#39
ghost
opened
1 year ago
2
Pretrain SpeechT5 on my own dataset
#38
hungker
closed
1 year ago
3
Missing speecht5 task
#37
maximerenou
closed
1 year ago
1
SpeechT5 Speech Enhancement
#36
avramandrei
opened
1 year ago
2
Fine-tunning on Hugging Face
#35
ramonsanabria
opened
1 year ago
1
SpeechUT inference and fine-tune problem
#34
ytf-philp
closed
1 year ago
3
add Hugging Face links
#33
hollance
closed
1 year ago
2
add SID in SpeechT5
#32
mechanicalsea
closed
1 year ago
1
Next