Current BigCodec configuration is of 80Hz, which is too high for my TTS task in terms of latency. Is it possible to change the configuration to 2 codebooks and reduce token rate in half?
I've tried low token rate with single codebook. The reconstrction CER is much higher than that of 80hz. So I want to increase the number of codebooks.
Current BigCodec configuration is of 80Hz, which is too high for my TTS task in terms of latency. Is it possible to change the configuration to 2 codebooks and reduce token rate in half?
I've tried low token rate with single codebook. The reconstrction CER is much higher than that of 80hz. So I want to increase the number of codebooks.