Open Rongjiehuang opened 3 years ago
hi, I try and find that the trick could not solve this problem. Because of the random value of synthesized sound in two synthesis, this minus could be "over". E.g., in some place a clearer segment (0.02, 0.05, 0.06) - a bias (0.05, 0.05, 0.02) = (-0.03, 0, 0.04), which means that the first place gets worse.
hi, I try and find that the trick could not solve this problem. Because of the random value of synthesized sound in two synthesis, this minus could be "over". E.g., in some place a clearer segment (0.02, 0.05, 0.06) - a bias (0.05, 0.05, 0.02) = (-0.03, 0, 0.04), which means that the first place gets worse.
In my case, it can solve the checkerboard artifacts problem. Maybe you can use some low-quality speech to train the model, like aishell3. I combine biaobei data and aishell3 in the training data, this problem can be solved. Besides, you can try u-law algorithm in different band and make normalization in different band to fix the problem.
Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.
Hi, I also have encountered with the straight line at a specific frequency when developing similar multiband architecture.for example multiband Mel-Gan.Do you have the trick to solve now?
Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.
Hi, I also have encountered with the straight line at a specific frequency when developing similar multiband architecture.for example multiband Mel-Gan.Do you have the trick to solve now?
There are three main general approaches for these constant lines:
Is better than hifigan??
@Rongjiehuang Thanks,the last advice works for me!
Hi author, I have found the notes as "the generated audio has interference at a specific frequency" in this repo. I have encountered with the straight line at a specific frequency when developing similar multiband architecture, and I wonder if such phenomenon is the one you mentioned? And do you have some advice or solutions? Thanks.