ZhangXInFD SpeechTokenizer issues

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

https://0nutation.github.io/SpeechTokenizer.github.io/

Apache License 2.0

466 stars 40 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Discriminator weights

#23 LaurinmyReha opened 4 days ago
0
Question about size when training

#22 LiuMY13 opened 6 days ago
0
请问一下semantic distillation在哪个py文件中有体现呀，没找到呢

#21 JoyceMind opened 1 week ago
0
用readme里面的示例代码进行语音的重构，发现重构wav的采样点数量和原始wav的采样点数量不一样

#20 JoyceMind opened 1 week ago
0
请问用这个模型能提取出表征音频的连续向量吗？

#19 JoyceMind opened 1 week ago
0
a bug in EuclideanCodebook

#18 krgy12138 opened 1 month ago
0
bitrate of pretrained models

#17 gen35 opened 1 month ago
1
Can we generate audio from just semantic tokens?

#16 macabdul9 opened 2 months ago
1
Ask for number range of training loss.

#15 liangpenglong opened 2 months ago
0
changing hubert model to another model

#14 acul3 opened 3 months ago
0
10 batchsize = 63.72G???(All File are 3s wav)

#13 coding-sharks opened 3 months ago
1
ImportError: cannot import name 'split_torch_state_dict_into_shards' from 'huggingface_hub'

#12 ehosseiniasl opened 4 months ago
2
Cross-lingual

#11 coding-sharks opened 4 months ago
1
zero grad issus in encodec?

#10 yuzuda283 opened 4 months ago
2
Add example.py

#9 karthik19967829 closed 5 months ago
0
distill loss weight?

#8 yuzuda283 closed 4 months ago
1
what is the input when inference for encoding?

#7 Edwardmark closed 5 months ago
2
Could you kindly share the training code?

#6 zhenye234 closed 4 months ago
2
How to deal with the integer values of RVQ

#5 phdshliang opened 7 months ago
1
About HuBERT unit

#4 0417keito closed 4 months ago
5
Fix typo in README.md

#3 eltociear closed 1 year ago
1
Training process?

#2 fmac2000 closed 12 months ago
9
Update README.md

#1 KeiKinn closed 1 year ago
1