Open lesterphillip opened 1 year ago
Some updates:
It's a little hard to retrain since it takes several days, so it takes some time to get some findings.
I've tried more variations on improving this reimplementation (as in the checklist above), and the original version is still the best despite the unusual practices. So, it would just be better to keep it as it is right now.
On the other hand, the current issue is that compared to the original version, the synthesis has muffled-like sounds, as attached below. fastsvc_comparison.zip
More samples found here.
I don't think I can spend a lot of time on this project as it takes too long to train and I think it would just be better to try newer architectures.
Adding here some implementation improvements that I need to do courtesy of comments from @r9y9
I can probably add some samples to see how much improvement these make.