I wonder if knn-vc could work for audio streams (say, in chunks of audio of 10-500ms) instead of whole audio files. Has this been explored? Could it work? I could not find any info online. I imagine that if the two successive audio chunks would split a phoneme in two this could cause problems?
Hey!
I wonder if knn-vc could work for audio streams (say, in chunks of audio of 10-500ms) instead of whole audio files. Has this been explored? Could it work? I could not find any info online. I imagine that if the two successive audio chunks would split a phoneme in two this could cause problems?