coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
http://coqui.ai
Mozilla Public License 2.0
34.84k stars 4.24k forks source link

[Bug] Voice conversion converting speaker of the `source_wav` to the speaker of the `target_wav` #2493

Closed ziyaad30 closed 1 year ago

ziyaad30 commented 1 year ago

Describe the bug

tts = TTS(model_name="voice_conversion_models/multilingual/vctk/freevc24", progress_bar=False, gpu=True)
tts.voice_conversion_to_file(source_wav="my/source.wav", target_wav="my/target.wav", file_path="output.wav")
(coqui) C:\Users\User\Desktop\coqui\TTS>python test.py
 > voice_conversion_models/multilingual/vctk/freevc24 is already downloaded.
Traceback (most recent call last):
  File "test.py", line 4, in <module>
    tts = TTS(model_name="voice_conversion_models/multilingual/vctk/freevc24", progress_bar=False, gpu=True)
  File "C:\Users\User\Desktop\coqui\TTS\TTS\api.py", line 277, in __init__
    self.load_tts_model_by_name(model_name, gpu)
  File "C:\Users\User\Desktop\coqui\TTS\TTS\api.py", line 368, in load_tts_model_by_name
    self.synthesizer = Synthesizer(
  File "C:\Users\User\Desktop\coqui\TTS\TTS\utils\synthesizer.py", line 86, in __init__
    self._load_tts(tts_checkpoint, tts_config_path, use_cuda)
  File "C:\Users\User\Desktop\coqui\TTS\TTS\utils\synthesizer.py", line 145, in _load_tts
    if self.tts_config["use_phonemes"] and self.tts_config["phonemizer"] is None:
  File "C:\Users\User\anaconda3\envs\coqui\lib\site-packages\coqpit\coqpit.py", line 614, in __getitem__
    return self.__dict__[arg]
KeyError: 'use_phonemes'

Expected behavior

No response

Logs

No response

Environment

TTS Version 0.13.0

Additional context

No response

erogol commented 1 year ago

https://github.com/coqui-ai/TTS/pull/2495 fixes it

nateraw commented 1 year ago

Hey @erogol that PR doesn't fix this issue

erogol commented 1 year ago

@nateraw the same code above works on my side with the PR