Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI
https://www.camb.ai
GNU Affero General Public License v3.0
2.47k stars 200 forks source link

Where is the output saved? #23

Closed 4DeepThought2 closed 3 months ago

4DeepThought2 commented 3 months ago

I feel like an idiot here. Running this on Windows (WSL with conda). Where in the heck does the audio file output get saved?

nolanblew commented 3 months ago

The output doesn't seem to be saved anywhere - just in memory.

You can save the output by first importing soundfile:

import soundfile as sf

Then at the end of the script, put:

sf.write('./output.wav', output_audio, sr) # replace output.wav with your output path if you want to change it

If you had a lot of trouble like I did getting very small or empty audio, make sure that you have your transcript filled out correctly, or set deep_clone to False (though in my experiments, the results were not anywhere close to the reference audio)

akshhack commented 3 months ago

Thanks for the answer @nolanblew !