erew123 / alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
GNU Affero General Public License v3.0
686 stars 71 forks source link

Remember settings for dataset creation and training #203

Closed RenNagasaki closed 2 months ago

RenNagasaki commented 2 months ago

Please, do me the favour of saving these settings. I'm having to train a shitload of voices, and having to refill all these fields each time, is killing me. 🤣

image

erew123 commented 2 months ago

Ill see what I can do on the next version.

If for now you want to fix the values in finetune.py:

Learning rate is line 1447 value=5e-6,

and the others are lines 1142 to 1165:

    parser.add_argument(
        "--num_epochs",
        type=int,
        help="Number of epochs to train. Default: 10",
        default=10,
    )
    parser.add_argument(
        "--batch_size",
        type=int,
        help="Batch size. Default: 4",
        default=4,
    )
    parser.add_argument(
        "--grad_acumm",
        type=int,
        help="Grad accumulation steps. Default: 1",
        default=1,
    )
    parser.add_argument(
        "--max_audio_length",
        type=int,
        help="Max permitted audio size in seconds. Default: 11",
        default=11,
    )

Thanks

RenNagasaki commented 2 months ago

Love that! Thanks! ❤️