ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
https://0nutation.github.io/SpeechTokenizer.github.io/
Apache License 2.0
466 stars 40 forks source link

Cross-lingual #11

Open coding-sharks opened 4 months ago

coding-sharks commented 4 months ago

Hello, I used the checkpoint file you trained with librispeech to infer the Chinese audio and it still works well. Is that what you expected? Because your dataset doesn't seem to use Chinese, only English data.

UkiTenzai commented 3 weeks ago

I'm the same