Open WhiteNightMo opened 5 days ago
It seems like the larget len is too long, you can specify the appropriate target length yourself.
It seems like the larget len is too long, you can specify the appropriate target length yourself.
I tried to change target_len to 8, but the output audio was missing the first 3 words and was slow overall. When I changed it to 10, the output audio read everything, but the speed was really slow. 10s.zip
as missing the first 3 words and was slow overall. When I changed it to 10, the output audio read everything, but the speed was really slow.
请问解决了吗
as missing the first 3 words and was slow overall. When I changed it to 10, the output audio read everything, but the speed was really slow.
请问解决了吗
没呢,倒腾不出来
as missing the first 3 words and was slow overall. When I changed it to 10, the output audio read everything, but the speed was really slow.
请问解决了吗
没呢,倒腾不出来
调不出来,要么太快要么胡说八道
as missing the first 3 words and was slow overall. When I changed it to 10, the output audio read everything, but the speed was really slow.
请问解决了吗
没呢,倒腾不出来
调不出来,要么太快要么胡说八道
难顶,我是要么太慢要么胡说八道
as missing the first 3 words and was slow overall. When I changed it to 10, the output audio read everything, but the speed was really slow.
请问解决了吗
没呢,倒腾不出来
调不出来,要么太快要么胡说八道
难顶,我是要么太慢要么胡说八道
我还有背景音
有人解决了吗? Anybody fixed this?
your prompt audio and prompt text are not matched completely
Problem Overview
I modified this file
models/tts/maskgct/maskgct_inference.py
, changes are as follows:Run command:
My original file: 5s.zip
Output file: generated_audio7.zip