Open HuuHuy227 opened 2 months ago
Any experiment give the result of pflow compare to vits2. Which is better?
Not sure. I tried on mandarin dataset. VITS2 is not good enough in speech pause and prosody, but pflow trained result is even worse. Is that the bad from "duration predictor and expension"?
Any experiment give the result of pflow compare to vits2. Which is better?