issues
search
ZhangXInFD
/
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
https://0nutation.github.io/SpeechTokenizer.github.io/
Apache License 2.0
466
stars
40
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Discriminator weights
#23
LaurinmyReha
opened
4 days ago
0
Question about size when training
#22
LiuMY13
opened
6 days ago
0
请问一下semantic distillation在哪个py文件中有体现呀,没找到呢
#21
JoyceMind
opened
1 week ago
0
用readme里面的示例代码进行语音的重构,发现重构wav的采样点数量和原始wav的采样点数量不一样
#20
JoyceMind
opened
1 week ago
0
请问用这个模型能提取出表征音频的连续向量吗?
#19
JoyceMind
opened
1 week ago
0
a bug in EuclideanCodebook
#18
krgy12138
opened
1 month ago
0
bitrate of pretrained models
#17
gen35
opened
1 month ago
1
Can we generate audio from just semantic tokens?
#16
macabdul9
opened
2 months ago
1
Ask for number range of training loss.
#15
liangpenglong
opened
2 months ago
0
changing hubert model to another model
#14
acul3
opened
3 months ago
0
10 batchsize = 63.72G???(All File are 3s wav)
#13
coding-sharks
opened
3 months ago
1
ImportError: cannot import name 'split_torch_state_dict_into_shards' from 'huggingface_hub'
#12
ehosseiniasl
opened
4 months ago
2
Cross-lingual
#11
coding-sharks
opened
4 months ago
1
zero grad issus in encodec?
#10
yuzuda283
opened
4 months ago
2
Add example.py
#9
karthik19967829
closed
5 months ago
0
distill loss weight?
#8
yuzuda283
closed
4 months ago
1
what is the input when inference for encoding?
#7
Edwardmark
closed
5 months ago
2
Could you kindly share the training code?
#6
zhenye234
closed
4 months ago
2
How to deal with the integer values of RVQ
#5
phdshliang
opened
7 months ago
1
About HuBERT unit
#4
0417keito
closed
4 months ago
5
Fix typo in README.md
#3
eltociear
closed
1 year ago
1
Training process?
#2
fmac2000
closed
12 months ago
9
Update README.md
#1
KeiKinn
closed
1 year ago
1