Closed uni-saurabh-vyas closed 4 months ago
Since the error message says something about a tensor with 0 elements, I suspect that there might be a problem with the audio you loaded. Can you try printing the shape of the audio that goes into the resample function inside of the tts.set_utterance_embedding function?
the audio file seems to be there
Toucan/InferenceInterfaces/ToucanTTSInterface.py(108)set_utterance_embedding() -> if len(wave.shape) > 1: # oh no, we found a stereo audio! (Pdb) wave.shape (83498,)
Channels : 1 Sample Rate : 8000 Precision : 16-bit Sample Encoding: 16-bit Signed Integer PCM
Nevermind, I think one of the files had issue/empty during iteration, adding try except block solved the issue.
Logs
Code:
Any help would be appreciated.