EveryVoiceTTS / EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language
https://docs.everyvoice.ca
Other
20 stars 2 forks source link

Rename `synthesize`'s sub commands to `X-to-audio` for a more intuitive name #181

Closed SamuelLarkin closed 10 months ago

SamuelLarkin commented 10 months ago

everyvoice synthesize's subcommands should be renamed:

to make the subcommands clearer, more general and less showing the underlying technical intricacies of the implementation.

everyvoice synthesize  --help

 Usage: everyvoice synthesize [OPTIONS] COMMAND [ARGS]...

 ┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
 ┃                                   Synthesize Help                                   ┃
 ┗━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┛

  • text-to-spec --- this is the most common model to run for performing normal speech
    synthesis.
  • spec-to-wav --- this is the model that turns your spectral features into audio. this
    type of synthesis is also known as copy synthesis and unless you know what you are
    doing, you probably don't want to do this.

╭─ Options ─────────────────────────────────────────────────────────────────────────────╮
│ --help  -h        Show this message and exit.                                         │
╰───────────────────────────────────────────────────────────────────────────────────────╯
╭─ Commands ────────────────────────────────────────────────────────────────────────────╮
│ text-to-wav  Given some text and a trained model, generate some audio. i.e. perform   │
│              typical speech synthesis                                                 │
│ spec-to-wav  Given some Mel spectrograms and a trained model, generate some audio.    │
│              i.e. perform copy synthesis                                              │
╰───────────────────────────────────────────────────────────────────────────────────────╯
marctessier commented 10 months ago

This is goes hand in hand with issue 182 : https://github.com/roedoejet/EveryVoice/issues/182 I will close this one.

Good point , right now we are only doing "wav" sound files but technically we could do mp3, ogg , whatever.... Definitely something to think about issue 182 , I will add that comment.